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(ii) TITLE OF INVENTION: GENE REGULATING AUREOBASIDIN SENSITIVITY 
(AS AMENDED) 

(iii) NUMBER OF SEQUENCES: 50 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: WENDEROTH, LIND & PONACK, L.L.P. 

(B) STREET: 2033 K Street, N.W., #800 

(C) CITY: Washington 

(D) STATE: D.C. 

(E) COUNTRY: U.S.A. 

(F) ZIP: 20006 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette, 3.5 inch, 1.44 mb 

(B) computer: IBM Compatible 

(C) OPERATING SYSTEM: MS-DOS 

(d) software: wordperfect 5.1 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 10/777,195 

(B) FILING DATE: February 13, 2004 

(vii) PRIOR APPLICATION DATA 

(A) APPLICATION NUMBER: 09/368,431 

(B) FILING DATE: August 5, 1999 

(viii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/716,873 

(B) FILING DATE: September 20, 1996 

(ix) ATTORNEY/ AGENT INFORMATION: 



(A) name: warren M. cheek, Jr. 

(B) REGISTRATION NUMBER: 33,367 

(C) REFERENCE/DOCKET NUMBER: 2003-1772/WMC/00001 
(X) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 202-721-8200 

(B) TELEFAX: 202-721-8250 

(C) TELEX: 
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(ii) molecule type: Genomic dna 



agatctgtgg 


cttccggttg 


gctacttgta 


accaactgat 


ggtcagatgg 


atctgccgtc 


60 


tgttttgatt 


tgaattttcc 


ctgctcattc 


tgattctgtg 


agaggctgca 


ttcattatca 


120 


catctcatac 


ccggcgcctg 


cgacttcggt 


cacctctgcg 


gtctggcggt 


taccggggtc 


180 


cgtctgagac 


tcgtcagtca 


gccattcgag 


tatgcgaact 


ctgactttgc 


tcacctaaga 


240 


gtttgcacga 


gatgccgaaa 


tcctcctcga 


gtagagtttg 


caaggcttga 


accttggtcc 


300 


ttgaagcccg 


aaagtggctc 


agtagtggga 


tcgatagtct 


ggttgttgaa 


gattttctcc 


360 


tccaccttac 


ctatggccgc 


tggccttctc 


cacctttcag 


gctttcaggc 


accctcggct 


420 


cggattctgt 


atcgtccggt 


accgaagcta 


gtcctagcta 


gtcaaagcta 


gtccaagcta 


480 


gtctcgtcaa 


ggtttggcgc 


agcgcggttc 


cgtgtaaagt 


acaaatttga 


aatacgaata 


540 


cgcagtactc 


gcagccggca 


cttccgctca 


gcccaggctc 


agaggctaag 


ggtgttggcg 


600 


cttcctcatc 


atcttcttct 


cgtcgacctt 


ttcctctttc 


tctccctatc 


ggtgcttctc 


660 


tccaacctca 


ttctcagtcg 


ttcgcccatc 


aggtttatac 


tccggctccg 


tggccatctg 


720 


cctccctcac 


gacctcctcg 


ttccaggttt 


tcctctcgac 


tgctgcgccc 


ttgcacttcg 


780 


ccttgcatca 


gtgaaacccc 


ctgcaacgtg 


acggctcaaa 


gacatcctcg 


tttggccgct 


840 


ggagaccgga 


gcgtgcgctt 


cgtttcgtct 


tcttcgaacc 


gatctcaatt 


tccccgctcg 


900 


ggttgacgcc 


gtcagcaccc 


tgctcgttgc 


ctaacggctt 
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gttattcaag 


accccttttc 


960 



tgccgcttcc 


gcgaccgatt 


tattcgtcgc 


cttccaactc 


ttgtacaatc 


ggggggaaag 


1020 


aaagcagacg 


gagttcgatc 


tggaggaatt 


atagctgagt 


cttgcccgca 


agactcgccg 


1080 


caaccatgaa 


tcaaacactt 


cccacgtgga 


aggaccgcac 


gcagaaccag 


tttggaaagc 


1140 


ttcagatcca 


ggttccatgg 


cggtccatcc 


aactgctcgt 


cccgcatcgc 


atgcggcgga 


1200 


agttaaggtc 


caaattgcgc 


agtagagcgt 


ctcctacctc 


gtcaatagcc 


tctttacaga 


1260 


cgtcgttatc 


gcctgcagac 


acactacgat 


cgctccaaag 


ccaccgatgg 


acggtttacg 


1320 


acttccaata 


tctgcttctg 


ttgatcgtgg 


gcatcttctc 


tttgaccgtt 


atcgagtcgc 


1380 


ccgggccttt 


gggcaaaacg 


gccattttct 


ccatgctcct 


attctctctc 


ctgatcccta 


1440 


tgacccgcca 


gttcttcctc 


ccgtttctgc 


cgattgccgg 


atggcttctg 


tttttctacg 


1500 


cctgccagtg 


agttaaaaac 


aacccgctac 


cagaccccgt 


gcagcagtta 


ctcacatatg 


1560 


caggttcatc 


ccaagcgatt 


ggcgccctgc 


gatttgggtt 


cgtgtcttgc 


ctgcactgga 


1620 


gaatattctc 


tacggcgcaa 


acatcagcaa 


catcctatcc 


gctcaccaga 


acgttgtgct 


1680 


tgacgtgctg 


gcgtggctac 


cctacggtat 


ctgccactat 


ggcgctccgt 


ttgtgtgctc 


1740 


gttgatcatg 


ttcatcttcg 


gtccgcccgg 


cactgttccc 


cttttcgcgc 


gcactttcgg 


1800 


ctatatcagt 


atgactgcgg 


ttactattca 


gctgtttttc 


ccttgctctc 


caccttggta 


1860 


tgagaatcgc 


tatggtctag 


ctccggcaga 


ctactccatc 


caaggtgatc 


ccgcagggct 


1920 


tgcccgcatt 


gacaagcttt 


tcggcatcga 


cctttacacg 


tctgttttcc 


atcagtcgcc 


1980 


tgttgtgttc 


ggcgcttttc 


cgtcgctgca 


tgctgccgac 


tcaaccctgg 


ccgcactttt 


2040 


catgagtcat 


gttttccccc 


gcatgaagcc 


cgtcttcgtg 


acctatactc 


tatggatgtg 


2100 


gtgggcaaca 


atgtacctct 


cacatcacta 


tgcggtcgat 


ttggttgcgg 


gtggtctcct 


2160 


ggccgccatt 


gctttctact 


tcgccaagac 


ccgattcctt 


ccccgtgtcc 


agctcgacaa 


2220 


gaccttccgt 


tgggactacg 


actatgtgga 


attcggcgag 


tctgccctgg 


agtatgggta 


2280 


tggtgcagct 


ggctatgatg 


gagacttcaa 


tctcgacagc 


gatgaatgga 


ctgttggttc 


2340 


ttcatcctcc 


gtctcctcag 


gctccttgag 


tcccgttgac 


gatcattact 


catgggaaac 


2400 


cgaggcactg 


acctccccac 


atactgatat 


tgagtccggc 


aggcatactt 


tcagcccttg 


2460 


agtagccaca 


aaccaaactc 


gatacctgca 


tatagcgatc 


tcgctcctcc 


tccactgcat 


2520 


ctatttacga 


gacggcgtta 


gaacatttca 


cgacattctg 


gctttattgc 


atcgagcaca 


2580 


tttcgacaca 


tatatcttta 


ataccctttc 


ttcggtgtcc 


cagatcatcg 


gttcgacctt 


2640 


aatgtacctc 


ggtccgaatc 


cgcctgggat 


actgtttctc 


tttccgccgc 


acttcactgt 


2700 


acattgcttg 


acattgcgaa 


accgggttgg 


gctcgaacgt 


gggatgggtt 


atcgctcatc 


2760 


gctacacgcc 


gttgctccat 


cataatgtta 


atggacacaa 


tggggctacg 


catcctggtg 


2820 


tttagtcctg 


gaagaccatc 


cgataacccc 


cgtcggtaac 
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actcgcttgt 


ctcgtgtcca 


2880 




cccagacact 


acttcaattc 


tcacttctat 


cgtccgctat 


taccttgacc 


tggtcgaacc 


2940 


catccttatt 


attcgtttcg 


actatgctat 


atatttattt 


ttaccattcg 


tgtcgatcgc 


3000 


tcatactctt 


ggcgcttggg 


actggaagca 


tttatattgg 


aaaaaatcac 


ggaatggggc 


3060 


gccttttctt 


cttgcacttc 


actcgctgtg 


catagacggt 


tttacatttc 


tgctttgcaa 


3120 


tgcatcacga 


actctgcatt 


agcatataga 


aagaggggaa 


ggatggacct 


tcttcttgat 


3180 


tgctcgcatg 


gtttatccat 


tcgctcaaag 


tggattacgt 


ccacatatta 


cccgggggct 


3240 


atacacatgg 


ctactgtgtt 


gctttctgac 


attcgccgga 


cgtgcaaggt 


tgggaggaga 


3300 


gtctgacgct 


gacggggctt 


gttgaaggat 


gttcacgcgt 


cccgatttga 


cccggcttcg 


3360 


actaacctca 


gattctcgac 


ttgttggacg 


gtgacttgac 


ttgcttgcta 


tggtctgacg 


3420 


ctctcacacc 


tacctatcac 


atcctcctca 


cctcacaaat 


tccgctcatg 


gacactatcc 


3480 


tcttcttttc 


gtttcccttg 


gatagtgtgt 


gtgtgtgtgt 


ggttggggca 


aattatccat 


3540 


agcagcagta 


ttattagtta 


taatccggta 


gtgttatgat 


ttatgaaggc 


aacttgtata 


3600 


ctattgccac 


tttgtccata 


tctcttgctt 


gtaatagaac 


tgacatcgcg 


acgcttcggc 


3660 


cacgatgcat 


ataaaaactc 


tagtcaacac 


gatattaaca 


agcgaaacca 


ttacgctgta 


3720 


aactattcag 


gatcgccgcg 


ggcccatctg 


ggacttgact 


gtactaaata 


tgtcctaaag 


3780 


caagcagact 


aaatatttaa 


cgtgggatat 


tattcatata 


cgcatatgta 


tacatagtca 


3840 


taacaagcca 


aggggtgggt 


aggggtgggt 


aattattatt 


ttttttcgtc 


gatacaagta 


3900 


tccatcctta 


aatgtccgtg 


gtctactctt 


cataaatctt 


aacccgctcc 


gcatactcct 


3960 


ttatcctcga 


gacaaaagtg 


tcttcaattt 


catcgccacg 


gccaccagca 


acgcggagga 


4020 


taaggtcgtt 


gagagaggcg 


cggccgagaa 


tgtcgacatg 


gtcgccttca 


tattgttagc 


4080 


atgcgacgtc 


agactggaac 


caggaaggga 


aaggagagag 


gtacctgtat 


ttggaccacc 


4140 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) length: 439 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Asn Gin Thr Leu Pro Thr Trp Lys Asp Arg Thr Gin Asn Gin 

4 




15 10 15 

Phe Gly Lys Leu Gin He Gin val Pro Trp Arg Ser lie Gin Leu 

20 25 30 

Leu val Pro His Arg Met Arg Arg Lys Leu Arg Ser Lys Leu Arg 

35 40 45 

Ser Arg Ala Ser Pro Thr ser Ser lie Ala Ser Leu Gin Thr ser 

50 55 60 

Leu Ser Pro Ala Asp Thr Leu Arg Ser Leu Gin Ser His Arg Trp 

65 70 75 

Thr val Tyr Asp Phe Gin Tyr Leu Leu Leu Leu lie val Gly lie 

80 85 90 

Phe ser Leu Thr val lie Glu ser Pro Gly Pro Leu Gly Lys Thr 

95 100 105 

Ala lie Phe Ser Met Leu Leu Phe Ser Leu Leu lie Pro Met Thr 

110 115 120 

Arg Gin Phe Phe Leu Pro Phe Leu Pro lie Ala Gly Trp Leu Leu 

125 130 135 

Phe Phe Tyr Ala Cys Gin Phe lie Pro Ser Asp Trp Arg Pro Ala 

140 145 150 

lie Trp val Arg val Leu Pro Ala Leu Glu Asn He Leu Tyr Gly 

155 160 165 

Ala Asn lie Ser Asn lie Leu Ser Ala His Gin Asn val val Leu 

170 175 180 

Asp val Leu Ala Trp Leu Pro Tyr Gly lie Cys His Tyr Gly Ala 

185 190 195 

Pro Phe val Cys Ser Leu lie Met Phe lie Phe Gly Pro Pro Gly 

200 205 210 

Thr val Pro Leu Phe Ala Arg Thr Phe Gly Tyr lie Ser Met Thr 

215 220 225 

Ala Val Thr lie Gin Leu Phe Phe Pro Cys Ser Pro Pro Trp Tyr 

230 235 240 

Glu Asn Arg Tyr Gly Leu Ala Pro Ala Asp Tyr Ser lie Gin Gly 

5 



245 250 255 

Asp Pro Ala Gly Leu Ala Arg lie Asp Lys Leu Phe Gly lie Asp 

260 265 270 

Leu Tyr Thr Ser val Phe His Gin Ser Pro Val Val Phe Gly Ala 

275 280 285 

Phe Pro Ser Leu His Ala Ala Asp Ser Thr Leu Ala Ala Leu Phe 

290 295 300 

Met Ser His val Phe Pro Arg Met Lys Pro val Phe val Thr Tyr 

305 310 315 

Thr Leu Trp Met Trp Trp Ala Thr Met Tyr Leu Ser His His Tyr 

320 325 330 

Ala Val Asp Leu Val Ala Gly Gly Leu Leu Ala Ala lie Ala Phe 

335 340 345 

Tyr Phe Ala Lys Thr Arg Phe Leu Pro Arg val Gin Leu Asp Lys 

350 355 360 

Thr Phe Arg Trp Asp Tyr Asp Tyr Val Glu Phe Gly Glu Ser Ala 

365 370 375 

Leu Glu Tyr Gly Tyr Gly Ala Ala Gly Tyr Asp Gly Asp Phe Asn 

380 385 390 

Leu Asp Ser Asp Glu Trp Thr Val Gly ser Ser Ser Ser val Ser 

395 400 405 

Ser Gly Ser Leu Ser Pro Val Asp Asp His Tyr Ser Trp Glu Thr 

410 415 420 

Glu Ala Leu Thr Ser Pro His Thr Asp lie Glu Ser Gly Arg His 

425 430 435 

Thr Phe Ser Pro 



INFORMATION FOR SEQ ID NO: 3: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2856 base pairs 

(B) TYPE: nucleic acid 




(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 



ggtttatact 


ccggctccgt 


ggccatctgc 


ctccctcacg 


acctcctcgt 


tccaggtttt 


60 


cctctcgact 


gctgcgccct 


tgcacttcgc 


cttgcatcag 


tgaaaccccc 


tgcaacgtga 


120 


cggctcaaag 


acatcctcgt 


ttggccgctg 


gagaccggag 


cgtgcgcttc 


gtttcgtctt 


180 


cttcgaaccg 


atctcaattt 


ccccgctcgg 


gttgacgccg 


tcagcaccct 


gctcgttgcc 


240 


taacggcttg 


ttattcaaga 


ccccttttct 


gccgcttccg 


cgaccgattt 


attcgtcgcc 


300 


ttccaactct 


tgtacaatcg 


gggggaaaga 


aagcagacgg 


agttcgatct 


ggaggaatta 


360 


tagctgagtc 


ttgcccgcaa 


gactcgccgc 


aaccatgaat 


caaacacttc 


ccacgtggaa 


420 


ggaccgcacg 


cagaaccagt 


ttggaaagct 


tcagatccag 


gttccatggc 


ggtccatcca 


480 


actgctcgtc 


ccgcatcgca 


tgcggcggaa 


gttaaggtcc 


aaattgcgca 


gtagagcgtc 


540 


tcctacctcg 


tcaatagcct 


ctttacagac 


gtcgttatcg 


cctgcagaca 


cactacgatc 


600 


gctccaaagc 


caccgatgga 


cggtttacga 


cttccaatat 


ctgcttctgt 


tgatcgtggg 


660 


catcttctct 


ttgaccgtta 


tcgagtcgcc 


cgggcctttg 


ggcaaaacgg 


ccattttctc 


720 


catgctccta 


ttctctctcc 


tgatccctat 


gacccgccag 


ttcttcctcc 


cgtttctgcc 


780 


gattgccgga 


tggcttctgt 


ttttctacgc 


ctgccagttc 


atcccaagcg 


attggcgccc 


840 


tgcgatttgg 


gttcgtgtct 


tgcctgcact 


ggagaatatt 


ctctacggcg 


caaacatcag 


900 


caacatccta 


tccgctcacc 


agaacgttgt 


gcttgacgtg 


ctggcgtggc 


taccctacgg 


960 


tatctgccac 


tatggcgctc 


cgtttgtgtg 


ctcgttgatc 


atgttcatct 


tcggtccgcc 


1020 


cggcactgtt 


ccccttttcg 


cgcgcacttt 


cggctatatc 


agtatgactg 


cggttactat 


1080 


tcagctgttt 


ttcccttgct 


ctccaccttg 


gtatgagaat 


cgctatggtc 


tagctccggc 


1140 


agactactcc 


atccaaggtg 


atcccgcagg 


gcttgcccgc 


attgacaagc 


ttttcggcat 


1200 


cgacctttac 


acgtctgttt 


tccatcagtc 


gcctgttgtg 


ttcggcgctt 


ttccgtcgct 


1260 


gcatgctgcc 


gactcaaccc 


tggccgcact 


tttcatgagt 


catgttttcc 


cccgcatgaa 


1320 


gcccgtcttc 


gtgacctata 


ctctatggat 


gtggtgggca 


acaatgtacc 


tctcacatca 


1380 


ctatgcggtc 


gatttggttg 


cgggtggtct 


cctggccgcc 


attgctttct 


acttcgccaa 


1440 


gacccgattc 


cttccccgtg 


tccagctcga 


caagaccttc 


cgttgggact 


acgactatgt 


1500 


ggaattcggc 


gagtctgccc 


tggagtatgg 


gtatggtgca 


gctggctatg 


atggagactt 


1560 


caatctcgac 


agcgatgaat 


ggactgttgg 


ttcttcatcc 
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tccgtctcct 


caggctcctt 


1620 



gagtcccgtt 


gacgatcatt 


actcatggga 


aaccgaggca ctgacctccc 


cacatactga 


1680 


tattgagtcc 


ggcaggcata 


ctttcagccc 


ttgagtagcc acaaaccaaa 


ctcgatacct 


1740 


gcatatagcg 


atctcgctcc 


tcctccactg 


catctattta cgagacggcg 


ttagaacatt 


1800 


tcacgacatt 


ctggctttat 


tgcatcgagc 


acatttcgac acatatatct 


ttaataccct 


1860 


ttcttcggtg 


tcccagatca 


tcggttcgac 


cttaatgtac ctcggtccga 


atccgcctgg 


1920 


gatactgttt 


ctctttccgc 


cgcacttcac 


tgtacattgc ttgacattgc 


gaaaccgggt 


1980 


tgggctcgaa 


cgtgggatgg 


gttatcgctc 


atcgctacac gccgttgctc 


catcataatg 


2040 


ttaatggaca 


caatggggct 


acgcatcctg 


gtgtttagtc ctggaagacc 


atccgataac 


2100 


ccccgtcggt 


aacactcgct 


tgtctcgtgt 


ccacccagac actacttcaa 


ttctcacttc 


2160 


tatcgtccgc 


tattaccttg 


acctggtcga 


acccatcctt attattcgtt 


tcgactatgc 


2220 


tatatattta 


tttttaccat 


tcgtgtcgat 


cgctcatact cttggcgctt 


gggactggaa 


2280 


gcatttatat 


tggaaaaaat 


cacggaatgg 


ggcgcctttt cttcttgcac 


ttcactcgct 


2340 


gtgcatagac 


ggttttacat 


ttctgctttg 


caatgcatca cgaactctgc 


attagcatat 


2400 


agaaagaggg 


gaaggatgga 


ccttcttctt 


gattgctcgc atggtttatc 


cattcgctca 


2460 


aagtggatta 


cgtccacata 


ttacccgggg 


gctatacaca tggctactgt 


gttgctttct 


2520 


gacattcgcc 


ggacgtgcaa 


ggttgggagg 


agagtctgac gctgacgggg 


cttgttgaag 


2580 


gatgttcacg 


cgtcccgatt 


tgacccggct 


tcgactaacc tcagattctc 

J ZJ 


gacttgttgg 

ZJ ZJ ZJ ZJ 


2640 


acggtgactt 


gacttgcttg 


ctatggtctg 


acgctctcac acctacctat 


cacatcctcc 


2700 


tcacctcaca 


aattccgctc 


atggacacta 


tcctcttctt ttcgtttccc 


ttggatagtg 


2760 


tgtgtgtgtg 


tgtggttggg 


gcaaattatc 


catagcagca gtattattag 


ttataatccg 


2820 


gtagtgttat 


gatttatgaa 


ggcaacttgt 


atacta 




2856 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 439 amino acids 

(B) TYPE: amino acid 

(C) strandedness: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 



Met Asn Gin Thr Leu Pro Thr Trp Lys Asp Arg Thr Gin Asn Gin 
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15 10 15 

Phe Gly Lys Leu Gin lie Gin Val Pro Trp Arg Ser lie Gin Leu 

20 25 30 

Leu val Pro His Arg Met Arg Arg Lys Leu Arg Ser Lys Leu Arg 

35 40 45 

Ser Arg Ala Ser Pro Thr ser ser lie Ala ser Leu Gin Thr Ser 

50 55 60 

Leu Ser Pro Ala Asp Thr Leu Arg ser Leu Gin Ser His Arg Trp 

65 70 75 

Thr val Tyr Asp Phe Gin Tyr Leu Leu Leu Leu lie Val Gly lie 

80 85 90 

Phe Ser Leu Thr val lie Glu Ser Pro Gly Pro Leu Gly Lys Thr 

95 100 105 

Ala lie Phe Ser Met Leu Leu Phe Ser Leu Leu lie Pro Met Thr 

110 115 120 

Arg Gin Phe Phe Leu Pro Phe Leu Pro lie Ala Gly Trp Leu Leu 

125 130 135 

Phe Phe Tyr Ala Cys Gin Phe lie Pro Ser Asp Trp Arg Pro Ala 

140 145 150 

lie Trp val Arg val Leu Pro Ala Leu Glu Asn lie Leu Tyr Gly 

155 160 165 

Ala Asn lie Ser Asn lie Leu Ser Ala His Gin Asn val val Leu 

170 175 180 

Asp val Leu Ala Trp Leu Pro Tyr Gly lie Cys His Tyr Gly Ala 

185 190 195 

Pro Phe Val Cys Ser Leu lie Met Phe lie Phe Gly Pro Pro Gly 

200 205 210 

Thr val Pro Leu Phe Ala Arg Thr Phe Gly Tyr lie Ser Met Thr 

215 220 225 

Ala val Thr lie Gin Leu Phe Phe Pro cys Ser Pro Pro Trp Tyr 

230 235 240 

Glu Asn Arg Tyr Gly Leu Ala Pro Ala Asp Tyr Ser lie Gin Gly 

9 



245 250 255 

Asp Pro Ala Gly Leu Ala Arg lie Asp Lys Leu Phe Gly lie Asp 

260 265 270 

Leu Tyr Thr Ser Gly Phe His Gin Ser Pro Val Val Phe Gly Ala 

275 280 285 

Phe Pro Ser Leu His Ala Ala Asp Ser Thr Leu Ala Ala Leu Phe 

290 295 300 

Met Ser His val Phe Pro Arg Met Lys Pro val Phe Val Thr Tyr 

305 310 315 

Thr Leu Trp Met Trp Trp Ala Thr Met Tyr Leu Ser His His Tyr 

320 325 330 

Ala val Asp Leu val Ala Gly Gly Leu Leu Ala Ala He Ala Phe 

335 340 345 

Tyr Phe Ala Lys Thr Arg Phe Leu Pro Arg val Gin Leu Asp Lys 

350 355 360 

Thr Phe Arg Trp Asp Tyr Asp Tyr val Glu Phe Gly Glu Ser Ala 

365 370 375 

Leu Glu Tyr Gly Tyr Gly Ala Ala Gly Tyr Asp Gly Asp Phe Asn 

380 385 390 

Leu Asp Ser Asp Glu Trp Thr val Gly Ser Ser ser ser val Ser 

395 400 405 

Ser Gly Ser Leu Ser Pro Val Asp Asp His Tyr Ser Trp Glu Thr 

410 415 420 

Glu Ala Leu Thr Ser Pro His Thr Asp lie Glu ser Gly Arg His 

425 430 435 

Thr Phe Ser Pro 



INFORMATION FOR SEQ ID NO: 5: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 436 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

10 



(D) TOPOLOGY: linear 
(ii) MOLECULE type: peptide 
(xi) sequence description: seq id no: 5: 

Met Asn Thr Thr Leu Pro Ser Trp Lys Asp Arg Thr Gin Asn Gin 

15 10 15 

Phe Gly Lys Leu Gin lie Gin Val Pro Trp Arg Thr lie Gin Leu 

20 25 30 

Leu Val Pro His Arg Met Arg Arg Lys lie Arg ser Lys Leu Arg 

35 40 45 

Ser Arg lie Ser Pro Thr ser Ser lie Ser Ser Leu Gin Thr Ser 

50 55 60 

Phe Ser Pro Val Asp Thr Leu Arg Ser Leu Gin Ser His Arg Trp 

65 70 75 

Thr Leu Tyr Asp Phe Gin Tyr Leu Leu Leu Leu lie Val Gly He 

80 85 90 

Phe Ser Leu Ser val Met Glu Ser Pro Gly Pro Leu Ala Lys Thr 

95 100 105 

Ala Ala Phe Thr Leu Leu Leu val Ser Leu Leu Leu Pro lie Thr 

110 115 120 

Arg Gin Phe Phe Leu Pro Phe Leu Pro lie Ala Gly Trp Leu lie 

125 130 135 

Phe Phe Tyr Ala Cys Gin Phe lie Pro Ser Asp Trp Arg Pro Ala 

140 145 150 

lie Trp val Arg val Leu Pro Ala Leu Glu Asn lie Leu Tyr Gly 

155 160 165 

Ala Asn He Ser Asn lie Leu Ser Ala His Gin Asn Val Val Leu 

170 175 180 

Asp val Leu Ala Trp Leu Pro Tyr Gly lie Cys His Tyr Gly Ala 

185 190 195 

Pro Phe Val Cys Ser Ala lie Met Phe lie Phe Gly Pro Pro Gly 

200 205 210 
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Thr Val Pro Leu Phe Ala Arg Thr Phe Gly Tyr lie Ser Met Ala 

215 220 225 

Ala Val Thr lie Gin Leu Phe Phe Pro Cys Ser Pro Pro Trp Tyr 

230 235 240 

Glu Asn Leu Tyr Gly Leu Ala Pro Ala Asp Tyr Ser Met Pro Gly 

245 250 255 

Asn Pro Ala Gly Leu Ala Arg lie Asp Glu Leu Phe Gly lie Asp 

260 265 270 

Leu Tyr Thr Ser Gly Phe Arg Gin Ser Pro Val Val Phe Gly Ala 

275 280 285 

Phe Pro Ser Leu His Ala Ala Asp Ser Thr Leu Ala Ala Leu Phe 

290 295 300 

Met Ser Gin Val Phe Pro Arg Leu Lys Pro Leu Phe val lie Tyr 

305 310 315 

Thr Leu Trp Met Trp Trp Ala Thr Met Tyr Leu Ser His His Tyr 

320 325 330 

Ala val Asp Leu val Gly Gly Gly Leu Leu Ala Thr val Ala Phe 

335 340 345 

Tyr Phe Ala Lys Thr Arg Phe Met Pro Arg val Gin Asn Asp Lys 

350 355 360 

Met Phe Arg Trp Asp Tyr Asp Tyr val Glu Tyr Gly Asp Ser Ala 

365 370 375 

Leu Asp Tyr Gly Tyr Gly Pro Ala Ser Phe Glu Gly Glu Phe Asn 

380 385 390 

Leu Asp Ser Asp Glu Trp Thr Val Gly Ser Ser Ser Ser lie Ser 

395 400 405 

Ser Gly Ser Leu Ser Pro val Asp Asp His Tyr ser Trp Glu Gly 

410 415 420 

Glu Thr Leu Ala Ser Pro Ala Thr Asp lie Glu Ser Gly Arg His 

425 430 435 

Phe 
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INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 401 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) molecule type: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 



Met Ala Asn Pro Phe Ser Arg Trp Phe Leu Ser Glu Arg Pro Pro 

15 10 15 

Asn Cys His val Ala Asp Leu Glu Thr Ser Leu Asp Pro His Gin 

20 25 30 

Thr Leu Leu Lys val Gin Lys Tyr Lys Pro Ala Leu ser Asp Trp 

35 40 45 

Val His Tyr lie Phe Leu Gly Ser lie Met Leu Phe val Phe lie 

50 55 60 

Thr Asn Pro Ala Pro Trp lie Phe Lys He Leu Phe Tyr Cys Phe 

65 70 75 

Leu Gly Thr Leu Phe lie lie Pro Ala Thr Ser Gin Phe Phe Phe 

80 85 90 

Asn Ala Leu Pro lie Leu Thr Trp val Ala Leu Tyr Phe Thr Ser 

95 100 105 

Ser Tyr Phe Pro Asp Asp Arg Arg Pro Pro lie Thr val Lys val 

110 115 120 

Leu Pro Ala Val Glu Thr lie Leu Tyr Gly Asp Asn Leu Ser Asp 

125 130 135 

lie Leu Ala Thr Ser Thr Asn Ser Phe Leu Asp lie Leu Ala Trp 

140 145 150 

Leu Pro Tyr Gly Leu Phe His Phe Gly Ala Pro Phe Val Val Ala 

155 160 165 

Ala lie Leu Phe val Phe Gly Pro Pro Thr val Leu Gin Gly Tyr 
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170 175 180 

Ala Phe Ala Phe Gly Tyr Met Asn Leu Phe Gly Val lie Met Gin 

185 190 195 

Asn Val Phe Pro Ala Ala Pro Pro Trp Tyr Lys lie Leu Tyr Gly 

200 205 210 
Leu Gin Ser Ala Asn Tyr Asp Met His Gly Ser Pro Gly Gly Leu 

215 220 225 

Ala Arg lie Asp Lys Leu Leu Gly lie Asn Met Tyr Thr Thr Ala 

230 235 240 

Phe Ser Asn ser ser val lie Phe Gly Ala Phe Pro Ser Leu His 

245 250 255 

Ser Gly Cys Ala Thr Met Glu Ala Leu Phe Phe Cys Tyr Cys Phe 

260 265 270 

Pro Lys Leu Lys Pro Leu Phe lie Ala Tyr val Cys Trp Leu Trp 

275 280 285 

Trp ser Thr Met Tyr Leu Thr His His Tyr Phe val Asp Leu Met 

290 295 300 

Ala Gly Ser val Leu Ser Tyr val lie Phe Gin Tyr Thr Lys Tyr 

305 310 315 

Thr His Leu Pro lie Val Asp Thr Ser Leu Phe Cys Arg Trp Ser 

320 325 330 

Tyr Thr Ser He Glu Lys Tyr Asp lie Ser Lys Ser Asp Pro Leu 

335 340 345 

Ala Ala Asp Ser Asn Asp lie Glu Ser val Pro Leu ser Asn Leu 

350 355 360 

Glu Leu Asp Phe Asp Leu Asn Met Thr Asp Glu Pro ser val ser 

365 370 375 

Pro Ser Leu Phe Asp Gly Ser Thr Ser val Ser Arg ser Ser Ala 

380 385 390 
Thr Ser lie Thr Ser Leu Gly Val Lys Arg Ala 

395 400 
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INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 422 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE type: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 



Met Ser Ala Leu Ser Thr Leu Lys Lys Arg Leu Ala Ala Cys Asn 
15 10 15 

Arg Ala Ser Gin Tyr Lys Leu Glu Thr Ser Leu Asn Pro Met Pro 

20 25 30 

Thr Phe Arg Leu Leu Arg Asn Thr Lys Trp Ser Trp Thr His Leu 

35 40 45 

Gin Tyr Val Phe Leu Ala Gly Asn Leu lie Phe Ala Cys lie val 

50 55 60 

lie Glu Ser Pro Gly Phe Trp Gly Lys Phe Gly lie Ala Cys Leu 

65 70 75 

Leu Ala lie Ala Leu Thr Val Pro Leu Thr Arg Gin lie Phe Phe 

80 85 90 

Pro Ala lie val lie lie Thr Trp Ala lie Leu Phe Tyr Ser Cys 

95 100 105 

Arg Phe lie Pro Glu Arg Trp Arg Pro Pro lie Trp Val Arg val 
110 115 120 

Leu Pro Thr Leu Glu Asn lie Leu Tyr Gly Ser Asn Leu Ser Ser 
125 130 135 

Leu Leu Ser Lys Thr Thr His Ser lie Leu Asp lie Leu Ala Trp 
140 145 150 

val Pro Tyr Gly Val Met His Tyr Ser Ala Pro Phe lie lie Ser 



155 



15 



160 



165 



Phe lie Leu Phe lie Phe Ala Pro Pro Gly Thr Leu Pro val Trp 

170 175 180 

Ala Arg Thr Phe Gly Tyr Met Asn Leu Phe Gly val Leu lie Gin 

185 190 195 

Met Ala Phe Pro Cys Ser Pro Pro Trp Tyr Glu Asn Met Tyr Gly 

200 205 210 

Leu Glu Pro Ala Thr Tyr Ala val Arg Gly Ser Pro Gly Gly Leu 

215 220 225 

Ala Arg lie Asp Ala Leu Phe Gly Thr Ser lie Tyr Thr Asp Gly 

230 235 240 

Phe Ser Asn Ser Pro val val Phe Gly Ala Phe Pro Ser Leu His 

245 250 255 

Ala Gly Trp Ala Met Leu Glu Ala Leu Phe Leu Ser His val Phe 

260 265 270 

Pro Arg Tyr Arg Phe Cys Phe Tyr Gly Tyr Val Leu Trp Leu Cys 

275 280 285 

Trp Cys Thr Met Tyr Leu Thr His His Tyr Phe Val Asp Leu val 

290 295 300 

Gly Gly Met Cys Leu Ala lie lie Cys Phe val Phe Ala Gin Lys 

305 310 315 

Leu Arg Leu Pro Gin Leu Gin Thr Gly Lys lie Leu Arg Trp Glu 

320 325 330 

Tyr Glu Phe Val lie His Gly His Gly Leu Ser Glu Lys Thr Ser 

335 340 345 

Asn Ser Leu Ala Arg Thr Gly Ser Pro Tyr Leu Leu Gly Arg Asp 

350 355 360 

Ser Phe Thr Gin Asn Pro Asn Ala val Ala Phe Met Ser Gly Leu 

365 370 375 

Asn Asn Met Glu Leu Ala Asn Thr Asp His Glu Trp Ser val Gly 

380 385 390 

Ser Ser Ser Pro Glu Pro Leu Pro Ser Pro Ala Ala Asp Leu lie 



395 



400 
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405 



Asp Arg Pro Ala Ser Thr Thr ser Ser lie Phe Asp Ala ser His 
410 415 420 

Leu Pro 



INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 471 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 



Met Ala Ser Ser lie Leu Arg Ser Lys lie lie Gin Lys Pro Tyr 

15 10 15 

Gin Leu Phe His Tyr Tyr Phe Leu Ser Glu Lys Ala Pro Gly Ser 

20 25 30 

Thr val Ser Asp Leu Asn Phe Asp Thr Asn lie Gin Thr Ser Leu 

35 40 45 

Arg Lys Leu Lys His His His Trp Thr Val Gly Glu He Phe His 

50 55 60 

Tyr Gly Phe Leu val Ser lie Leu Phe Phe val Phe val Val Phe 

65 70 75 

Pro Ala Ser Phe Phe lie Lys Leu Pro lie lie Leu Ala Phe Ala 

80 85 90 

Thr Cys Phe Leu lie Pro Leu Thr Ser Gin Phe Phe Leu Pro Ala 

95 100 105 

Leu Pro val Phe Thr Trp Leu Ala Leu Tyr Phe Thr Cys Ala Lys 

110 115 120 

lie Pro Gin Glu Trp Lys Pro Ala lie Thr Val Lys val Leu Pro 

125 130 135 

Ala Met Glu Thr lie Leu Tyr Gly Asp Asn Leu Ser Asn val Leu 
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140 145 150 

Ala Thr lie Thr Thr Gly Val Leu Asp lie Leu Ala Trp Leu Pro 

155 160 165 

Tyr Gly lie lie His Phe Ser Phe Pro Phe val Leu Ala Ala lie 

170 175 180 

lie Phe Leu Phe Gly Pro Pro Thr Ala Leu Arg ser Phe Gly Phe 

185 190 195 

Ala Phe Gly Tyr Met Asn Leu Leu Gly val Leu lie Gin Met Ala 

200 205 210 

Phe Pro Ala Ala Pro Pro Trp Tyr Lys Asn Leu His Gly Leu Glu 

215 220 225 

Pro Ala Asn Tyr Ser Met His Gly Ser Pro Gly Gly Leu Gly Arg 

230 235 240 

lie Asp Lys Leu Leu Gly val Asp Met Tyr Thr Thr Gly Phe Ser 

245 250 255 

Asn ser ser lie lie Phe Gly Ala Phe Pro ser Leu His Ser Gly 

260 265 270 

cys Cys lie Met Glu val Leu Phe Leu Cys Trp Leu Phe Pro Arg 

275 280 285 

Phe Lys Phe val Trp val Thr Tyr Ala Ser Trp Leu Trp Trp Ser 

290 295 300 

Thr Met Tyr Leu Thr His His Tyr Phe Val Asp Leu lie Gly Gly 

305 310 315 

Ala Met Leu Ser Leu Thr val Phe Glu Phe Thr Lys Tyr Lys Tyr 

320 325 330 

Leu Pro Lys Asn Lys Glu Gly Leu Phe Cys Arg Trp Ser Tyr Thr 

335 340 345 

Glu lie Glu Lys lie Asp lie Gin Glu lie Asp Pro Leu Ser Tyr 

350 355 360 

Asn Tyr He Pro val Asn Ser Asn Asp Asn Glu Ser Arg Leu Tyr 

365 370 t375 

Thr Arg val Tyr Gin Glu Ser Gin Val Ser Pro Pro Gin Arg Ala 
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380 385 390 

Glu Thr Pro Glu Ala Phe Glu Met Ser Asn Phe Ser Arg Ser Arg 

395 400 405 

Gin Ser Ser Lys Thr Gin Val Pro Leu Ser Asn Leu Thr Asn Asn 

410 415 420 

Asp Gin val Ser Gly lie Asn Glu Glu Asp Glu Glu Glu Glu Gly 

425 430 435 

Asp Glu lie Ser Ser Ser Thr Pro Ser Val Phe Glu Asp Glu Pro 

440 445 450 

Gin Gly Ser Thr Tyr Ala Ala Ser Ser Ala Thr Ser Val Asp Asp 

455 460 465 

Leu Asp Ser Lys Arg Asn 

470 

INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) molecule TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: 

(D) OTHER INFORMATION: /note= "xaa at position 3 is Val or il 

(ix) FEATURE: 

(A) NAME/KEY: 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: 

(D) OTHER information: /note= "xaa at position 7 is Leu or val 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 



Leu Asp Xaa Leu Ala Trp Xaa Pro Tyr Gly 
1 5 10 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) length: 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) topology: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Phe Gly Ala Phe Pro Ser Leu His 
1 5 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE type: peptide 

(ix) FEATURE: 

(A) NAME/KEY: 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: 

(D) OTHER INFORMATION: /note= "Xaa at position 5 is Ser or Thr M 
(ix) FEATURE: 

(A) NAME/KEY: 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: 
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(D) OTHER information: /note= "Xaa at position 9 is Ala or Phe" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Thr Met Tyr Leu xaa His His Tyr xaa val Asp Leu 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2935 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 



attttcttcc 


ccataacaac 


tcttctcgcc 


cttcctccgg 


ctccgtggcc 


aaattgtttt 


60 


atgcagcgcc 


tcctagcgat 


ttaacctcgt 


tctcgttgcc 


cttgcctgtc 


cgccttgcgt 


120 


cagtacgacc 


cttgcaacgt 


gaccttcccc 


agagtatcct 


cgtttggccg 


ctggagaccg 


180 


gagcttgcac 


cctcataaac 


tagctcttcg 


aaatcaattc 


tccgttctcc 


agagattatc 


240 


ggatcgaatc 


tctccgctgt 


cgacaccttt 


cgtctctcgg 


tgatcctcgc 


ccttggagtc 


300 


tcgtcacgtt 


gacgccttga 


acccctggcc 


gccaactcca 


cataggagac 


cacacttcat 


360 


tcttcccccg 


ccataattgc 


agcaccctcc 


gtctcccttc 


gagctcctcc 


tggatcatca 


420 


agtccgaaag 


gattagactc 


gtcgcagcga 


tgaataccac 


ccttccatcc 


tggaaggatc 


480 


ggacgcaaaa 


ccagttcggc 


aagctccaga 


tccaagtccc 


atggcgcacc 


atacaacttc 


540 


tcgtgccgca 


ccgtatgcga 


cggaagattc 


ggtccaagct 


gcgcagtcgg 


atctcgccta 


600 


cctcatcgat 


atcctcgttg 


cagacgtcat 


tctcacctgt 


cgatacactc 


aggtcgctgc 


660 


aaagtcatag 


atggacgctc 


tatgactttc 


agtatctttt 


gctgctgatt 


gtcggcatat 


720 


tctcgctgag 


cgttatggaa 


tcacctggac 


cattggcaaa 


gaccgccgcg 


tttacgctac 


780 


ttctcgtctc 


tctccttctc 


ccgattacgc 


gccagttctt 


cttgccattc 


ctcccgattg 


840 


caggatggct 


tatatttttc 


tacgcttgcc 


agttcatccc 


gagcgactgg 


cgccctgcaa 


900 


tctgggttcg 


cgtgctgccg 


gctctggaaa 


acattctcta 


cggtgctaat 


atcagtaaca 


960 


tcctttccgc 


tcaccaaaat 


gtggtgcttg 


acgttttggc 
21 


gtggcttccc 


tacggaatct 


1020 



gccattatgg 


cgcgccattt 


gtgtgctcag 


cgatcatgtt 


catctttggt 


cctcccggca 


1080 


ccgtccccct 


tttcgctcga 


acttttggat 


acatcagcat 


ggctgcagtc 


accattcagc 


1140 


tgtttttccc 


ctgctctcct 


ccgtggtacg 


aaaatctgta 


tggtttggct 


ccggctgatt 


1200 


actccatgcc 


gggtaatcct 


gcgggccttg 


ctcgcatcga 


tgagcttttt 


gggatagact 


1260 


tgtacacatc 


gggcttcaga 


caatctcccg 


tcgtgtttgg 


cgcatttcct 


tccctacatg 


1320 


ccgctgattc 


gacacttgca 


gctctattta 


tgagccaagt 


gttcccacgg 


ttgaagccct 


1380 


tgtttgtcat 


ctatactctc 


tggatgtggt 


gggctacaat 


gtatctttcg 


caccactacg 


1440 


ctgttgatct 


ggtcggtggt 


ggcctcttgg 


caactgtcgc 


gttctacttt 


gctaaaacgc 


1500 


ggttcatgcc 


tcgcgtccag 


aatgataaga 


tgttccgctg 


ggactacgat 


tatgttgagt 


1560 


acggcgattc 


cgcactcgac 


tatgggtacg 


gtccagccag 


cttcgaaggc 


gaattcaacc 


1620 


ttgatagcga 


tgagtggacc 


gttggttctt 


cgtcatccat 


ttcgtccggc 


tccctcagtc 


1680 


cagttgacga 


ccactactct 


tgggagggcg 


agactcttgc 


ctctcctgcc 


accgacatcg 


1740 


agtctggaag 


gcatttctga 


tcctgctcaa 


tgagccttga 


tacgtactac 


actgtgtacg 


1800 


tgctactgca 


ttgactaatg 


agacggcgtt 


ttcaaacaaa 


ttttaacgac 


atgcttggtt 


1860 


atcgcattga 


gctgatttcg 


acacatatat 


atgtttaata 


cgttttgggg 


acactccagg 


1920 


gattcatgac 


ggttgcttca 


tatcccgacc 


tggggatgga 


ttgacctggt 


tgtgcccaat 


1980 


tttcttctgc 


ctaacgtttt 


gattatacat 


gcatttttca 


cgaaaccagc 


cggcccgcca 


2040 


tgatcgtgac 


ctcaatttga 


gctcgaatct 


tcctggggcc 


tccagcgata 


attcttaatg 


2100 


ctcgttccga 


gggtgccaca 


tcggacattc 


gcttgtacaa 


cttttgcaga 


acgaacattt 


2160 


tcaccgattc 


caacttgagt 


cattcgctta 


ctactttcaa 


ctggtcgaga 


aacttcgctt 


2220 


cttttcagct 


cggctaggtg 


cataaatatt 


ttacattcgt 


gtcgatcgct 


cacatttcac 


2280 


ggcgcctgga 


aacttggggg 


tttcgatttc 


attggaaagg 


ataaacaaca 


tgggctgggc 


2340 


gccttttaca 


cgcactacat 


tcgcttagaa 


aagttctgat 


gcttttaatg 


attcttgcat 


2400 


tggcatatag 


aaaggggtcc 


tccagactcg 


ctactgtggt 


cctctctcaa 


ccccacactc 


2460 


gcttgctttt 


aacagtggac 


accccgtgga 


gctacgtctc 


catcaaatat 


ttggcatcaa 


2520 


ccggaatcga 


tgccaggagg 


actgagctta 


ctcacggtga 


ccgtcgggta 


aaaggcgttc 


2580 


atacagaaca 


tctcctctat 


cctcctgtcc 


catctcgatt 


ctctggctgc 


tggacgcaac 


2640 


acacctcgct 


gcacgttttc 


gacttcctaa 


tacgacctaa 


tatcattctc 


ggttttcttt 


2700 


gctctggctc 


gcggccgcca 


tttatatggc 


gtgtcggctc 


gagtctgaag 


tgaacttttt 


2760 


tctcctttct 


ggcctccaca 


actttccgat 


ccctagcagc 


ttcccgtgca 


cagcgaggtg 


2820 


ttgttggatg 


attgttccat 


agcattatca 


ttattcctaa 


tccggtagcg 


ttatgattta 


2880 


tgaagaacag 


tgatgtacat 


tattatgcgg 


tgatcaaaaa 
22 


aaaaaaaaaa 


aaaaa 


2935 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2856 base pairs 

(B) type: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) molecule type: Genomic dna 
(iv) anti-sense: Yes 

(xi) sequence description: seq id no: 13: 



tagtatacaa 


gttgccttca 


taaatcataa 


cactaccgga 


ttataactaa 


taatactgct 


60 


gctatggata 


atttgcccca 


accacacaca 


cacacacact 


atccaaggga 


aacgaaaaga 


120 


agaggatagt 


gtccatgagc 


ggaatttgtg 


aggtgaggag 


gatgtgatag 


gtaggtgtga 


180 


gagcgtcaga 


ccatagcaag 


caagtcaagt 


caccgtccaa 


caagtcgaga 


atctgaggtt 


240 


agtcgaagcc 


gggtcaaatc 


gggacgcgtg 


aacatccttc 


aacaagcccc 


gtcagcgtca 


300 


gactctcctc 


ccaaccttgc 


acgtccggcg 


aatgtcagaa 


agcaacacag 


tagccatgtg 


360 


tatagccccc 


gggtaatatg 


tggacgtaat 


ccactttgag 


cgaatggata 


aaccatgcga 


420 


gcaatcaaga 


agaaggtcca 


tccttcccct 


ctttctatat 


gctaatgcag 


agttcgtgat 


480 


gcattgcaaa 


gcagaaatgt 


aaaaccgtct 


atgcacagcg 


agtgaagtgc 


aagaagaaaa 


540 


ggcgccccat 


tccgtgattt 


tttccaatat 


aaatgcttcc 


agtcccaagc 


gccaagagta 


600 


tgagcgatcg 


acacgaatgg 


taaaaataaa 


tatatagcat 


agtcgaaacg 


aataataagg 


660 


atgggttcga 


ccaggtcaag 


gtaatagcgg 


acgatagaag 


tgagaattga 


agtagtgtct 


720 


gggtggacac 


gagacaagcg 


agtgttaccg 


acgggggtta 


tcggatggtc 


ttccaggact 


780 


aaacaccagg 


atgcgtagcc 


ccattgtgtc 


cattaacatt 


atgatggagc 


aacggcgtgt 


840 


agcgatgagc 


gataacccat 


cccacgttcg 


agcccaaccc 


ggtttcgcaa 


tgtcaagcaa 


900 


tgtacagtga 


agtgcggcgg 


aaagagaaac 


agtatcccag 


gcggattcgg 


accgaggtac 


960 


attaaggtcg 


aaccgatgat 


ctgggacacc 


gaagaaaggg 


tattaaagat 


atatgtgtcg 


1020 


aaatgtgctc 


gatgcaataa 


agccagaatg 


tcgtgaaatg 


ttctaacgcc 


gtctcgtaaa 


1080 


tagatgcagt 


ggaggaggag 


cgagatcgct 


atatgcaggt 


atcgagtttg 


gtttgtggct 


1140 


actcaagggc 


tgaaagtatg 


cctgccggac 


tcaatatcag 


tatgtgggga 


ggtcagtgcc 


1200 


tcggtttccc 


atgagtaatg 


atcgtcaacg 


ggactcaagg 
23 


agcctgagga 


gacggaggat 


1260 



gaagaaccaa 


cagtccattc 


atcgctgtcg 


agattgaagt 


ctccatcata 


gccagctgca 


1320 


ccatacccat 


actccagggc 


agactcgccg 


aattccacat 


agtcgtagtc 


ccaacggaag 


1380 


gtcttgtcga 


gctggacacg 


gggaaggaat 


cgggtcttgg 


cgaagtagaa agcaatggcg 


1440 


gccaggagac 


cacccgcaac 


caaatcgacc 


gcatagtgat 


gtgagaggta 


cattgttgcc 


1500 


caccacatcc 


atagagtata 


ggtcacgaag 


acgggcttca 


tgcgggggaa aacatgactc 


1560 


atgaaaagtg 


cggccagggt 


tgagtcggca 


gcatgcagcg 


acggaaaagc 


gccgaacaca 


1620 


acaggcgact 


gatggaaaac 


agacgtgtaa 


aggtcgatgc 


cgaaaagctt 


gtcaatgcgg 


1680 


gcaagccctg 


cgggatcacc 


ttggatggag 


tagtctgccg 


gagctagacc 


atagcgattc 


1740 


tcataccaag 


gtggagagca 


agggaaaaac 


agctgaatag 


taaccgcagt 


catactgata 


1800 


tagccgaaag 


tgcgcgcgaa 


aaggggaaca 


gtgccgggcg 


gaccgaagat 


gaacatgatc 


1860 


aacgagcaca 


caaacggagc 


gccatagtgg 


cagataccgt 


agggtagcca 


cgccagcacg 


1920 


tcaagcacaa 


cgttctggtg 


agcggatagg 


atgttgctga 


tgtttgcgcc 


gtagagaata 


1980 


ttctccagtg 


caggcaagac 


acgaacccaa 


atcgcagggc 


gccaatcgct 


tgggatgaac 


2040 


tggcaggcgt 


agaaaaacag 


aagccatccg 


gcaatcggca 


gaaacgggag 


gaagaactgg 


2100 


cgggtcatag 


ggatcaggag 


agagaatagg 


agcatggaga 


aaatggccgt 


tttgcccaaa 


2160 


ggcccgggcg 


actcgataac 


ggtcaaagag 


aagatgccca 


cgatcaacag 


aagcagatat 


2220 


tggaagtcgt 


aaaccgtcca 


tcggtggctt 


tggagcgatc 


gtagtgtgtc 


tgcaggcgat 


2280 


aacgacgtct 


gtaaagaggc 


tattgacgag 


gtaggagacg 


ctctactgcg 


caatttggac 


2340 


cttaacttcc 


gccgcatgcg 


atgcgggacg 


agcagttgga 


tggaccgcca 


tggaacctgg 


2400 


atctgaagct 


ttccaaactg 


gttctgcgtg 


cggtccttcc 


acgtgggaag 


tgtttgattc 


2460 


atggttgcgg 


cgagtcttgc 


gggcaagact 


cagctataat 


tcctccagat 


cgaactccgt 


2520 


ctgctttctt 


tccccccgat 


tgtacaagag 


ttggaaggcg 


acgaataaat 


cggtcgcgga 


2580 


agcggcagaa 


aaggggtctt 


gaataacaag 


ccgttaggca 


acgagcaggg 


tgctgacggc 


2640 


gtcaacccga 


gcggggaaat 


tgagatcggt 


tcgaagaaga 


cgaaacgaag 


cgcacgctcc 


2700 


ggtctccagc 


ggccaaacga 


ggatgtcttt 


gagccgtcac 


gttgcagggg 


gtttcactga 


2760 


tgcaaggcga 


agtgcaaggg 


cgcagcagtc 


gagaggaaaa 


cctggaacga 


ggaggtcgtg 


2820 


agggaggcag 


atggccacgg 


agccggagta 


taaacc 






2856 



(2) INFORMATION FOR SEQ ID NO: 14: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2856 bases 

(B) type: nucleic acid 



(C) strandedness: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: mRNA 
(iv) ANTI-SENSE: Yes 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 



uaguauacaa 


guugccuuca 


uaaaucauaa 


cacuaccgga 


uuauaacuaa 


uaauacugcu 


60 


gcuauggaua 


auuugcccca 


accacacaca 


cacacacacu 


auccaaggga 


aacgaaaaga 


120 


agaggauagu 


guccaugagc 


ggaauuugug 


aggugaggag 


gaugugauag 


guagguguga 


180 


gagcgucaga 


ccauagcaag 


caagucaagu 


caccguccaa 


caagucgaga 


aucugagguu 


240 


agucgaagcc 


gggucaaauc 


gggacgcgug 


aacauccuuc 


aacaagcccc 


gucagcguca 


300 


gacucuccuc 


ccaaccuugc 


acguccggcg 


aaugucagaa 


agcaacacag 


uagccaugug 


360 


uauagccccc 


ggguaauaug 


uggacguaau 


ccacuuugag 


cgaauggaua 


aaccaugcga 


420 


gcaaucaaga 


agaaggucca 


uccuuccccu 


cuuucuauau 


gcuaaugcag 


aguucgugau 


480 


gcauugcaaa 


gcagaaaugu 


aaaaccgucu 


augcacagcg 


agugaagugc 


aagaagaaaa 


540 


ggcgccccau 


uccgugauuu 


uuuccaauau 


aaaugcuucc 


agucccaagc 


gccaagagua 


600 


ugagcgaucg 


acacgaaugg 


uaaaaauaaa 


uauauagcau 


agucgaaacg 


aauaauaagg 


660 


auggguucga 


ccaggucaag 


guaauagcgg 


acgauagaag 


ugagaauuga 


aguagugucu 


720 


ggguggacac 


gagacaagcg 


aguguuaccg 


acggggguua 


ucggaugguc 


uuccaggacu 


780 


aaacaccagg 


augcguagcc 


ccauuguguc 


cauuaacauu 


augauggagc 


aacggcgugu 


840 


agcgaugagc 


gauaacccau 


cccacguucg 


agcccaaccc 


gguuucgcaa 


ugucaagcaa 


900 


uguacaguga 


agugcggcgg 


aaagagaaac 


aguaucccag 


gcggauucgg 


accgagguac 


960 


auuaaggucg 


aaccgaugau 


cugggacacc 


gaagaaaggg 


uauuaaagau 


auaugugucg 


1020 


aaaugugcuc 


gaugcaauaa 


agccagaaug 


ucgugaaaug 


uucuaacgcc 


gucucguaaa 


1080 


uagaugcagu 


ggaggaggag 


cgagaucgcu 


auaugcaggu 


aucgaguuug 


guuuguggcu 


1140 


acucaagggc 


ugaaaguaug 


ccugccggac 


ucaauaucag 


uaugugggga 


ggucagugcc 


1200 


ucgguuuccc 


augaguaaug 


aucgucaacg 


ggacucaagg 


agccugagga 


gacggaggau 


1260 


gaagaaccaa 


caguccauuc 


aucgcugucg 


agauugaagu 


cuccaucaua 


gccagcugca 


1320 


ccauacccau 


acuccagggc 


agacucgccg 


aauuccacau 


agucguaguc 


ccaacggaag 


1380 


gucuugucga 


gcuggacacg 


gggaaggaau 


cgggucuugg 


cgaaguagaa 


agcaauggcg 


1440 


gccaggagac 


cacccgcaac 


caaaucgacc 


gcauagugau 


gugagaggua 


cauuguugcc 


1500 


caccacaucc 


auagaguaua 


ggucacgaag 


acgggcuuca 
25 


ugcgggggaa 


aacaugacuc 


1560 




augaaaagug 


cggccagggu 


ugagucggca 


gcaugcagcg 


acggaaaagc 


gccgaacaca 


1620 


acaggcgacu 


gauggaaaac 


agacguguaa 


aggucgaugc 


cgaaaagcuu 


gucaaugcgg 


1680 


gcaagcccug 


cgggaucacc 


uuggauggag 


uagucugccg 


gagcuagacc 


auagcgauuc 


1740 


ucauaccaag 


guggagagca 


agggaaaaac 


agcugaauag 


uaaccgcagu 


cauacugaua 


1800 


uagccgaaag 


ugcgcgcgaa 


aaggggaaca 


gugccgggcg 


gaccgaagau 


gaacaugauc 


1860 


aacgagcaca 


caaacggagc 


gccauagugg 


cagauaccgu 


aggguagcca 


cgccagcacg 


1920 


ucaagcacaa 


cguucuggug 


agcggauagg 


auguugcuga 


uguuugcgcc 


guagagaaua 


1980 


uucuccagug 


caggcaagac 


acgaacccaa 


aucgcagggc 


gccaaucgcu 


ugggaugaac 


2040 


uggcaggcgu 


agaaaaacag 


aagccauccg 


gcaaucggca 


gaaacgggag 


gaagaacugg 


2100 


cgggucauag 


ggaucaggag 


agagaauagg 


agcauggaga 


aaauggccgu 


uuugcccaaa 


2160 


ggcccgggcg 


acucgauaac 


ggucaaagag 


aagaugccca 


cgaucaacag 


aagcagauau 


2220 


uggaagucgu 


aaaccgucca 


ucgguggcuu 


uggagcgauc 


guaguguguc 


ugcaggcgau 


2280 


aacgacgucu 


guaaagaggc 


uauugacgag 


guaggagacg 


cucuacugcg 


caauuuggac 


2340 


cuuaacuucc 


gccgcaugcg 


augcgggacg 


agcaguugga 


uggaccgcca 


uggaaccugg 


2400 


aucugaagcu 


uuccaaacug 


guucugcgug 


cgguccuucc 


acgugggaag 


uguuugauuc 


2460 


augguugcgg 


cgagucuugc 


gggcaagacu 


cagcuauaau 


uccuccagau 


cgaacuccgu 


2520 


cugcuuucuu 


uccccccgau 


uguacaagag 


uuggaaggcg 


acgaauaaau 


cggucgcgga 


2580 


agcggcagaa 


aaggggucuu 


gaauaacaag 


ccguuaggca 


acgagcaggg 


ugcugacggc 


2640 


gucaacccga 


gcggggaaau 


ugagaucggu 


ucgaagaaga 


cgaaacgaag 


cgcacgcucc 


2700 


ggucuccagc 


ggccaaacga 


ggaugucuuu 


gagccgucac 


guugcagggg 


guuucacuga 


2760 


ugcaaggcga 


agugcaaggg 


cgcagcaguc 


gagaggaaaa 


ccuggaacga 


ggaggucgug 


2820 


agggaggcag 


auggccacgg 


agccggagua 


uaaacc 






2856 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2385 base pairs 

(B) type: nucleic acid 

(C) STRANDEDNESS: double 

(D) topology: linear 

(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
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aagctttttt 


gcctctgcaa 


aagttccttt 


ctcgaattgg 


ttttttgagg 


aaaagcaagt 


60 


taataaacta 


attatattat 


atataattag 


caattttata 


aaaaaaataa 


aaaaatagcc 


120 


ctgattgctg 


gcaactgtga 


gctgaacatt 


ggttaatcgg 


tccatctttt 


tttaaatatt 


180 


ttacatcgct 


acttttaagt 


gcttgacact 


tgcatttaat 


agctactttc 


tttccttcat 


240 


aaaaattcct 


tttttttcct 


ttagttttcc 


ggttaattcc 


ttacgaaatt 


tttttcgtac 


300 


gcttcccttt 


tttactctga 


taattctttg 


aagcaatgtc 


tgctctttcg 


accttaaaaa 


360 


agcgccttgc 


tgcgtgtaac 


cgagcatccc 


aatacaagtt 


ggaaacaagc 


ttaaacccta 


420 


tgcctacatt 


tcgtttgcta 


cgcaatacga 


aatggtcatg 


gacacatttg 


caatatgtgt 


480 


ttctagcagg 


taatttgatt 


tttgcttgta 


ttgtcattga 


atctcctgga 


ttctggggga 


540 


aatttggcat 


tgcctgtctt 


ttggccattg 


cgttgaccgt 


tcctttaaca 


cgccaaattt 


600 


tttttcctgc 


cattgttatc 


atcacctggg 


caattttatt 


ttactcttgt 


aggtttattc 


660 


cagaacgctg 


gcgtccaccc 


atatgggttc 


gtgttttacc 


cacacttgaa 


aatattcttt 


720 


atggctctaa 


tctttctagt 


cttctctcga 


aaaccacgca 


tagcatcctt 


gatattttgg 


780 


cctgggttcc 


atatggagtc 


atgcattatt 


cggctccttt 


tatcatttca 


tttattcttt 


840 


tcatctttgc 


acctcctgga 


actcttccag 


tttgggctcg 


aacttttggt 


tatatgaatt 


900 


tatttggtgt 


tcttatccaa 


atggctttcc 


cctgttctcc 


tccttggtat 


gaaaatatgt 


960 


atggtttaga 


acctgccacg 


tatgcagtac 


gtggctctcc 


tggtggattg 


gcccgtattg 


1020 


atgctctctt 


cggcactagc 


atttacactg 


attgtttttc 


taactctccg 


gttgtttttg 


1080 


gtgcctttcc 


atctcttcac 


gctggatggg 


ccatgctgga 


agcacttttc 


ctttcgcatg 


1140 


tgtttcctcg 


ataccgcttc 


tgcttttatg 


gatatgttct 


atggctttgc 


tggtgtacta 


1200 


tgtaccttac 


ccaccactac 


tttgtagatt 


tggtcggcgg 


tatgtgttta 


gctattatat 


1260 


gcttcgtttt 


tgctcaaaag 


ctacgcctcc 


cacagttgca 


aactggtaaa 


atccttcgtt 


1320 


gggaatacga 


gtttgttatc 


cacggtcatg 


gtctttccga 


aaaaaccagc 


aactccttgg 


1380 


ctcgtaccgg 


cagcccatac 


ttacttggaa 


gggattcttt 


tactcaaaac 


cctaatgcag 


1440 


tagccttcat 


gagtggtctt 


aacaatatgg 


aacttgctaa 


caccgatcat 


gaatggtccg 


1500 


tgggttcatc 


atcacctgag 


ccgttaccta 


gtcctgctgc 


tgatttgatt 


gatcgtcctg 


1560 


ccagtaccac 


ttcctccatc 


tttgatgcaa 


gtcatcttcc 


ttaaatcaac 


gtgctttaag 


1620 


aatatatttc 


caaaagctac 


atgatacatt 


gactagaatc 


ggtttgattc 


atagtggtat 


1680 


tggaatgatg 


ttgttcattg 


tgttttttaa 


ctgttaatct 


gacatccatt 


gagtcattct 


1740 


ttacaatttg 


taaaattaat 


ttgtatcact 


aattttgaag 


gaagctattt 


tggtattaat 


1800 


accgcttttg 


gtctccactt 


ccttttcgaa 


actcttaaca 


gcgattaggc 


cgggtatctt 


1860 


ccagtgtgat 


gtataggtat 


ttgtcgtttt 


tttatcattt 


ccgttaataa 


agaactcttt 


1920 
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tatccagctt cttacactgt caactgttgt gaaaggaaca catttagaat ttcattttcc 1980 

ttatttgttg tgatttaaat cgtttgacat aattttaaat ttggtttgaa atgtgtgtga 2040 

gaaggcttgt tttattcatt tagtttattg cttgtttgca cgaaaatcca gaacggagca 2100 

ttaatgtaat ccttttttat tctgtaaagc gtttttatac aaatgttggt tatacgtttc 2160 

taaaataaga atattgttat aataatatag ttttttctat catttgttac acacactaaa 2220 

gagacattaa ggataagcaa atgtgttaaa atgataatat attttggaaa catttataaa 2280 

gaaattaagc agctttgact aactacattt ttgttttttt cctaagcaaa actgtatagt 2340 

tatacacgcg agctgtattc acttccattg tagtgacttg agctc 2385 



(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 422 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 



Met Ser Ala Leu Ser Thr Leu Lys Lys Arg Leu Ala Ala Cys Asn 
15 10 15 

Arg Ala Ser Gin Tyr Lys Leu Glu Thr Ser Leu Asn Pro Met Pro 

20 25 30 

Thr Phe Arg Leu Leu Arg Asn Thr Lys Trp Ser Trp Thr His Leu 

35 40 45 

Gin Tyr val Phe Leu Ala Gly Asn Leu lie Phe Ala Cys lie val 

50 55 60 

lie Glu ser Pro Gly Phe Trp Gly Lys Phe Gly He Ala Cys Leu 

65 70 75 

Leu Ala lie Ala Leu Thr val Pro Leu Thr Arg Gin lie Phe Phe 

80 85 90 

Pro Ala lie val lie lie Thr Trp Ala lie Leu Phe Tyr Ser Cys 



95 
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100 



105 



Arg Phe lie Pro Glu Arg Trp Arg Pro Pro lie Trp Val Arg Val 

110 115 120 

Leu Pro Thr Leu Glu Asn lie Leu Tyr Gly Ser Asn Leu Ser ser 

125 130 135 

Leu Leu Ser Lys Thr Thr His Ser lie Leu Asp lie Leu Ala Trp 

140 145 150 

val Pro Tyr Gly val Met His Tyr Ser Ala Pro Phe lie lie ser 

155 160 165 

Phe lie Leu Phe lie Phe Ala Pro Pro Gly Thr Leu Pro val Trp 

170 175 180 

Ala Arg Thr Phe Gly Tyr Met Asn Leu Phe Gly val Leu lie Gin 

185 190 195 

Met Ala Phe Pro Cys Ser Pro Pro Trp Tyr Glu Asn Met Tyr Gly 

200 205 210 

Leu Glu Pro Ala Thr Tyr Ala Val Arg Gly Ser Pro Gly Gly Leu 

215 220 225 

Ala Arg lie Asp Ala Leu Phe Gly Thr ser lie Tyr Thr Asp Cys 

230 235 240 

Phe Ser Asn ser Pro val val Phe Gly Ala Phe Pro Ser Leu His 

245 250 255 

Ala Gly Trp Ala Met Leu Glu Ala Leu Phe Leu Ser His val Phe 

260 265 270 

Pro Arg Tyr Arg Phe Cys Phe Tyr Gly Tyr val Leu Trp Leu Cys 

275 280 285 

Trp Cys Thr Met Tyr Leu Thr His His Tyr Phe val Asp Leu val 

290 295 300 

Gly Gly Met Cys Leu Ala lie lie Cys Phe val Phe Ala Gin Lys 

305 310 315 

Leu Arg Leu Pro Gin Leu Gin Thr Gly Lys lie Leu Arg Trp Glu 

320 325 330 

Tyr Glu Phe Val lie His Gly His Gly Leu Ser Glu Lys Thr Ser 



335 



340 
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345 



Asn Ser Leu Ala Arg Thr Gly Ser Pro Tyr Leu Leu Gly Arg Asp 

350 355 360 

Ser Phe Thr Gin Asn Pro Asn Ala Val Ala Phe Met Ser Gly Leu 

365 370 375 

Asn Asn Met Glu Leu Ala Asn Thr Asp His Glu Trp Ser val Gly 

380 385 390 

Ser Ser Ser Pro Glu Pro Leu Pro Ser Pro Ala Ala Asp Leu lie 

395 400 405 

Asp Arg Pro Ala Ser Thr Thr Ser Ser lie Phe Asp Ala Ser His 

410 415 420 

Leu Pro 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2385 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 



aagctttttt 


gcctctgcaa 


aagttccttt 


ctcgaattgg 


ttttttgagg 


aaaagcaagt 


60 


taataaacta 


attatattat 


atataattag 


caattttata 


aaaaaaataa 


aaaaatagcc 


120 


ctgattgctg 


gcaactgtga 


gctgaacatt 


ggttaatcgg 


tccatctttt 


tttaaatatt 


180 


ttacatcgct 


acttttaagt 


gcttgacact 


tgcatttaat 


agctactttc 


tttccttcat 


240 


aaaaattcct 


tttttttcct 


ttagttttcc ggttaattcc 


ttacgaaatt 


tttttcgtac 


300 


gcttcccttt 


tttactctga 


taattctttg 


aagcaatgtc 


tgctctttcg 


accttaaaaa 


360 


agcgccttgc 


tgcgtgtaac 


cgagcatccc 


aatacaagtt 


ggaaacaagc 


ttaaacccta 


420 


tgcctacatt 


tcgtttgcta 


cgcaatacga 


aatggtcatg 


gacacatttg 


caatatgtgt 


480 


ttctagcagg 


taatttgatt 


tttgcttgta 


ttgtcattga 


atctcctgga 


ttctggggga 


540 


aatttggcat 


tgcctgtctt 


ttggccattg 


cgttgaccgt 


tcctttaaca 


cgccaaattt 


600 


tttttcctgc 


cattgttatc 


atcacctggg 


caattttatt 
30 


ttactcttgt 


aggtttattc 


660 



cagaacgctg 


gcgtccaccc 


atatgggttc 


gtgttttacc 


cacacttgaa 


aatattcttt 


720 


atggctctaa 


tctttctagt 


cttctctcga 


aaaccacgca 


tagcatcctt 


gatattttgg 


780 


cctgggttcc 


atatggagtc 


atgcattatt 


cggctccttt 


tatcatttca 


tttattcttt 


840 


tcatctttgc 


acctcctgga 


actcttccag 


tttgggctcg 


aacttttggt 


tatatgaatt 


900 


tatttggtgt 


tcttatccaa 


atggctttcc 


cctgttctcc 


tccttggtat 


gaaaatatgt 


960 


atggtttaga 


acctgccacg 


tatgcagtac 


gtggctctcc 


tggtggattg 


gcccgtattg 


1020 


atgctctctt 


cggcactagc 


atttacactg 


atggtttttc 


taactctccg 


gttgtttttg 


1080 


gtgcctttcc 


atctcttcac 


gctggatggg 


ccatgctgga 


agcacttttc 


ctttcgcatg 


1140 


tgtttcctcg 


ataccgcttc 


tgcttttatg 


gatatgttct 


atggctttgc 


tggtgtacta 


1200 


tgtaccttac 


ccaccactac 


tttgtagatt 


tggtcggcgg 


tatgtgttta 


gctattatat 


1260 


gcttcgtttt 


tgctcaaaag 


ctacgcctcc 


cacagttgca 


aactggtaaa 


atccttcgtt 


1320 


gggaatacga 


gtttgttatc 


cacggtcatg 


gtctttccga 


aaaaaccagc 


aactccttgg 


1380 


ctcgtaccgg 


cagcccatac 


ttacttggaa 


gggattcttt 


tactcaaaac 


cctaatgcag 


1440 


tagccttcat 


gagtggtctt 


aacaatatgg 


aacttgctaa 


caccgatcat 


gaatggtccg 


1500 


tgggttcatc 


atcacctgag 


ccgttaccta 


gtcctgctgc 


tgatttgatt 


gatcgtcctg 


1560 


ccagtaccac 


ttcctccatc 


tttgatgcaa 


gtcatcttcc 


ttaaatcaac 


gtgctttaag 


1620 


aatatatttc 


caaaagctac 


atgatacatt 


gactagaatc 


ggtttgattc 


atagtggtat 


1680 


tggaatgatg 


ttgttcattg 


tgttttttaa 


ctgttaatct 


gacatccatt 


gagtcattct 


1740 


ttacaatttg 


taaaattaat 


ttgtatcact 


aattttgaag 


gaagctattt 


tggtattaat 


1800 


accgcttttg 


gtctccactt 


ccttttcgaa 


actcttaaca 


gcgattaggc 


cgggtatctt 


1860 


ccagtgtgat 


gtataggtat 


ttgtcgtttt 


tttatcattt 


ccgttaataa 


agaactcttt 


1920 


tatccagctt 


cttacactgt 


caactgttgt 


gaaaggaaca 


catttagaat 


ttcattttcc 


1980 


ttatttgttg 


tgatttaaat 


cgtttgacat 


aattttaaat 


ttggtttgaa 


atgtgtgtga 


2040 


gaaggcttgt 


tttattcatt 


tagtttattg 


cttgtttgca 


cgaaaatcca 


gaacggagca 


2100 


ttaatgtaat 


ccttttttat 


tctgtaaagc 


gtttttatac 


aaatgttggt 


tatacgtttc 


2160 


taaaataaga 


atattgttat 


aataatatag 


ttttttctat 


catttgttac 


acacactaaa 


2220 


gagacattaa 


ggataagcaa 


atgtgttaaa 


atgataatat 


attttggaaa 


catttataaa 


2280 


gaaattaagc 


agctttgact 


aactacattt 


ttgttttttt 


cctaagcaaa 


actgtatagt 


2340 


tatacacgcg 


agctgtattc 


acttccattg 


tagtgacttg 


agctc 




2385 



(2) INFORMATION FOR SEQ ID NO: 18: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 422 amino acids 

(B) TYPE: amino acid 

(C) strandedness : si ngl e 

(D) TOPOLOGY: linear 
(ii) molecule type: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 



Met Ser Ala Leu Ser Thr Leu Lys Lys Arg Leu Ala Ala Cys Asn 
15 10 15 

Arg Ala Ser Gin Tyr Lys Leu Glu Thr Ser Leu Asn Pro Met Pro 
20 25 30 

Thr Phe Arg Leu Leu Arg Asn Thr Lys Trp Ser Trp Thr His Leu 
35 40 45 

Gin Tyr val Phe Leu Ala Gly Asn Leu lie Phe Ala Cys lie val 
50 55 60 

lie Glu Ser Pro Gly Phe Trp Gly Lys Phe Gly lie Ala Cys Leu 
65 70 75 

Leu Ala lie Ala Leu Thr Val Pro Leu Thr Arg Gin lie Phe Phe 
80 85 90 

Pro Ala lie Val lie lie Thr Trp Ala lie Leu Phe Tyr Ser Cys 
95 100 105 

Arg Phe He Pro Glu Arg Trp Arg Pro Pro lie Trp val Arg Val 
110 115 120 

Leu Pro Thr Leu Glu Asn lie Leu Tyr Gly Ser Asn Leu Ser Ser 
125 130 135 

Leu Leu Ser Lys Thr Thr His Ser lie Leu Asp lie Leu Ala Trp 
140 145 150 

val Pro Tyr Gly Val Met His Tyr Ser Ala Pro Phe lie lie Ser 
155 160 165 

Phe He Leu Phe lie Phe Ala Pro Pro Gly Thr Leu Pro Val Trp 
170 175 180 

Ala Arg Thr Phe Gly Tyr Met Asn Leu Phe Gly val Leu lie Gin 
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185 190 195 

Met Ala Phe Pro Cys Ser Pro Pro Trp Tyr Glu Asn Met Tyr Gly 

200 205 210 

Leu Glu Pro Ala Thr Tyr Ala Val Arg Gly Ser Pro Gly Gly Leu 

215 220 225 

Ala Arg lie Asp Ala Leu Phe Gly Thr Ser lie Tyr Thr Asp Gly 

230 235 240 

Phe Ser Asn ser Pro val val Phe Gly Ala Phe Pro Ser Leu His 

245 250 255 

Ala Gly Trp Ala Met Leu Glu Ala Leu Phe Leu Ser His val Phe 

260 265 270 

Pro Arg Tyr Arg Phe Cys Phe Tyr Gly Tyr Val Leu Trp Leu Cys 

275 280 285 

Trp cys Thr Met Tyr Leu Thr His His Tyr Phe val Asp Leu val 

290 295 300 

Gly Gly Met Cys Leu Ala lie lie Cys Phe val Phe Ala Gin Lys 

305 310 315 

Leu Arg Leu Pro Gin Leu Gin Thr Gly Lys lie Leu Arg Trp Glu 

320 325 330 

Tyr Glu Phe Val lie His Gly His Gly Leu Ser Glu Lys Thr Ser 

335 340 345 

Asn Ser Leu Ala Arg Thr Gly Ser Pro Tyr Leu Leu Gly Arg Asp 

350 355 360 

Ser Phe Thr Gin Asn Pro Asn Ala val Ala Phe Met Ser Gly Leu 

365 370 375 

Asn Asn Met Glu Leu Ala Asn Thr Asp His Glu Trp ser val Gly 

380 385 390 

ser ser ser Pro Glu Pro Leu Pro Ser Pro Ala Ala Asp Leu lie 

395 400 405 

Asp Arg Pro Ala Ser Thr Thr Ser Ser lie Phe Asp Ala Ser His 

410 415 420 

Leu Pro 
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(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) length: 2340 base pairs 

(B) TYPE: nucleic acid 
(c) strandedness: double 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 



tttctttctg 


tcaaagaata 


ataaagtgcc catcagtgtt 


catatttgtt 


acaaagtggt 


60 


tttctgattt 


ggtactactg 


cagaggcgta ttttttgctt 


cagttaccat 


agcgtaagaa 


120 


cactagcgac ttttgttcgt 


gaaccaacag agtaggattt 


ctactgctac 


atctcttagg 


180 


tagttggtta gtccgatcgc 


tcacttttgg ttgttgttaa 


gtacttcata 


agtttatcct 


240 


tttccttttt 


cacactgagc 


tactttgggt atagcttttg 


gcccaaggat 


ctttgaattt 


300 


tctccaaaag 


tactttattt 


tatatcctac aggttgcggt 


tttcatattt 


taaaaagctt 


360 


tttaatcatt 


cctttgcgta 


tggcaaaccc tttttcgaga 


tggtttctat 


cagagagacc 


420 


tccaaactgc 


catgtagccg 


atttagaaac aagtttagat 


ccccatcaaa 


cgttgttgaa 


480 


ggtgcaaaaa 


tacaaacccg 


ctttaagcga ctgggtgcat 


tacatcttct 


tgggatccat 


540 


catgctgttt 


gtgttcatta 


ctaatcccgc accttggatc 


ttcaagatcc 


ttttttattg 


600 


tttcttgggc 


actttattca 


tcattccagc tacgtcacag 


tttttcttca 


atgccttgcc 


660 


catcctaaca tgggtggcgc 


tgtatttcac ttcatcgtac 


tttccagatg 


accgcaggcc 


720 


tcctattact 


gtcaaagtgt 


taccagcggt ggaaacaatt 


ttatacggcg 


acaatttaag 


780 


tgatattctt 


gcaacatcga 


cgaattcctt tttggacatt 


ttagcatggt 


taccgtacgg 


840 


actatttcat 


tatggggccc 


catttgtcgt tgctgccatc 


ttattcgtat 


ttggtccacc 


900 


aactgttttg 


caaggttatg 


cttttgcatt tggttatatg 


aacctgtttg 


gtgttatcat 


960 


gcaaaatgtc 


tttccagccg 


ctcccccatg gtataaaatt 


ctctatggat 


tgcaatcagc 


1020 


caactatgat 


atgcatggct 


cgcctggtgg attagctaga 


attgataagc 


tactcggtat 


1080 


taatatgtat 


actacagctt 


tttcaaattc ctccgtcatt 


ttcggtgctt 


ttccttcact 


1140 


gcattccggg 


tgtgctacta 


tggaagccct gtttttctgt 


tattgttttc 


caaaattgaa 


1200 


gcccttgttt 


attgcttatg 


tttgctggtt atggtggtca 


actatgtatc 


tgacacacca 


1260 


ttattttgta gaccttatgg 


caggttctgt gctgtcatac 
34 


gttattttcc 


agtacacaaa 


1320 



gtacacacat 


ttaccaattg 


tagatacatc 


tcttttttgc 


agatggtcat 


acacttcaat 


1380 


tgagaaatac 


gatatatcaa 


agagtgatcc 


attggctgca 


gattcaaacg 


atatcgaaag 


1440 


tgtccctttg 


tccaacttgg 


aacttgactt 


tgatcttaat 


atgactgatg 


aacccagtgt 


1500 


aagcccttcg 


ttatttgatg 


gatctacttc 


tgtttctcgt 


tcgtccgcca 


cgtctataac 


1560 


gtcactaggt 


gtaaagaggg 


cttaatgagt 


attttatctg 


caattacgga 


tacggttggt 


1620 


cttatgtaga 


tacatataaa 


tatatatctt 


tttctttctt 


tttcttagtc 


aggattgtcg 


1680 


tttagcataa 


tatacatgta 


gtttatttaa 


tcacatacca 


ctgattatct 


ttagaatttt 


1740 


ataaattttt 


gaaataaatg 


ggtggctttt 


aatggtgtct 


atgttaagtg 


aggcttttag 


1800 


aatgctcttc 


ctgctttgtt 


tattatatgt 


gtatgaaaga 


tatgtatgta 


tttacatgtg 


1860 


tttgtagcgt 


ccccagtcaa 


aacctgtgcg 


ctatacctaa 


atggattgat 


aatcttcatt 


1920 


cactaattct 


aaaatagact 


tcttccccaa 


agaacggtgt 


aacgatgagg 


ctctatccag 


1980 


ctgcttatct 


aaatcaactt 


taacgatgga 


tgatcttatg 


acacggggat 


ctttctttaa 


2040 


agttcttaga 


atttcagact 


gtaccgcagc 


tgatgaatca 


aacagcatta 


aaaagtgata 


2100 


tgctcgaaaa 


tgtttttcct 


ggtctttctt 


cattatttta 


ggaagatacc 


ttatgcccat 


2160 


gggtacaatg 


tccctcacca 


cacctctgtt 


ttgaataatc 


agtttcccga 


ttgtggaaga 


2220 


caattctttt 


gcttccaact 


ttggcgcatt 


ggagttggtt 


atgcgaacaa 


gtccgatcag 


2280 


ctcataaagc 


atcttagtga 


aaagggtggt 


tttgcgttat 


tctttcctct 


gttgaagctt 


2340 



(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 401 amino acids 

(B) type: amino acid 

(c) strandedness: single 
(D) topology: linear 

(ii) molecule type: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

Met Ala Asn Pro Phe Ser Arg Trp Phe Leu Ser Glu Arg Pro Pro 



1 



5 



10 



15 



Asn 



Cys His Val Ala Asp Leu Glu Thr Ser Leu Asp Pro His Gin 



20 



25 



30 



Thr 



Leu Leu Lys Val Gin Lys Tyr Lys Pro Ala Leu Ser Asp Trp 

35 



35 40 45 

Val His Tyr lie Phe Leu Gly Ser lie Met Leu Phe val Phe lie 

50 55 60 

Thr Asn Pro Ala Pro Trp lie Phe Lys lie Leu Phe Tyr Cys Phe 

65 70 75 

Leu Gly Thr Leu Phe lie lie Pro Ala Thr ser Gin Phe Phe Phe 

80 85 90 
Asn Ala Leu Pro He Leu Thr Trp val Ala Leu Tyr Phe Thr Ser 

95 100 105 
Ser Tyr Phe Pro Asp Asp Arg Arg Pro Pro lie Thr val Lys val 

110 115 120 

Leu Pro Ala val Glu Thr lie Leu Tyr Gly Asp Asn Leu Ser Asp 

125 130 135 

lie Leu Ala Thr Ser Thr Asn ser Phe Leu Asp lie Leu Ala Trp 

140 145 150 

Leu Pro Tyr Gly Leu Phe His Tyr Gly Ala Pro Phe Val Val Ala 

155 160 165 

Ala lie Leu Phe val Phe Gly Pro Pro Thr val Leu Gin Gly Tyr 

170 175 180 

Ala Phe Ala Phe Gly Tyr Met Asn Leu Phe Gly val lie Met Gin 

185 190 195 

Asn Val Phe Pro Ala Ala Pro Pro Trp Tyr Lys lie Leu Tyr Gly 

200 205 210 

Leu Gin Ser Ala Asn Tyr Asp Met His Gly Ser Pro Gly Gly Leu 

215 220 225 

Ala Arg lie Asp Lys Leu Leu Gly lie Asn Met Tyr Thr Thr Ala 

230 235 240 

Phe Ser Asn ser ser val lie Phe Gly Ala Phe Pro Ser Leu His 

245 250 255 

ser Gly Cys Ala Thr Met Glu Ala Leu Phe Phe Cys Tyr cys Phe 

260 265 270 

Pro Lys Leu Lys Pro Leu Phe lie Ala Tyr val Cys Trp Leu Trp 
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275 280 285 

Trp Ser Thr Met Tyr Leu Thr His His Tyr Phe val Asp Leu Met 

290 295 300 

Ala Gly Ser Val Leu Ser Tyr val lie Phe Gin Tyr Thr Lys Tyr 

305 310 315 

Thr His Leu Pro lie val Asp Thr Ser Leu Phe Cys Arg Trp Ser 

320 325 330 

Tyr Thr ser lie Glu Lys Tyr Asp lie Ser Lys Ser Asp Pro Leu 

335 340 345 

Ala Ala Asp Ser Asn Asp lie Glu Ser Val Pro Leu Ser Asn Leu 

350 355 360 

Glu Leu Asp Phe Asp Leu Asn Met Thr Asp Glu Pro Ser val Ser 

365 370 375 

Pro Ser Leu Phe Asp Gly Ser Thr ser val Ser Arg Ser Ser Ala 

380 385 390 

Thr Ser lie Thr Ser Leu Gly val Lys Arg Ala 

395 400 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) length: 2340 base pairs 

(B) type: nucleic acid 
(c) strandedness: double 
(D) topology: linear 

(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

tttctttctg tcaaagaata ataaagtgcc catcagtgtt catatttgtt acaaagtggt 60 

tttctgattt ggtactactg cagaggcgta ttttttgctt cagttaccat agcgtaagaa 120 

cactagcgac ttttgttcgt gaaccaacag agtaggattt ctactgctac atctcttagg 180 

tagttggtta gtccgatcgc tcacttttgg ttgttgttaa gtacttcata agtttatcct 240 

tttccttttt cacactgagc tactttgggt atagcttttg gcccaaggat ctttgaattt 300 

37 




tctccaaaag 


tactttattt 


tatatcctac 


aggttgcggt 


tttcatattt 


taaaaagctt 


360 


tttaatcatt 


cctttgcgta 


tggcaaaccc 


tttttcgaga 


tggtttctat 


cagagagacc 


420 


tccaaactgc 


catgtagccg 


atttagaaac 


aagtttagat 


ccccatcaaa 


cgttgttgaa 


480 


ggtgcaaaaa 


tacaaacccg 


ctttaagcga 


ctgggtgcat 


tacatcttct 


tgggatccat 


540 


catgctgttt 


gtgttcatta 


ctaatcccgc 


accttggatc 


ttcaagatcc 


ttttttattg 


600 


tttcttgggc 


actttattca 


tcattccagc 


tacgtcacag 


tttttcttca 


atgccttgcc 


660 


catcctaaca 


tgggtggcgc 


tgtatttcac 


ttcatcgtac 


tttccagatg 


accgcaggcc 


720 


tcctattact 


gtcaaagtgt 


taccagcggt 


ggaaacaatt 


ttatacggcg 


acaatttaag 


780 


tgatattctt 


gcaacatcga 


cgaattcctt 


tttggacatt 


ttagcatggt 


taccgtacgg 


840 


actatttcat 


tttggggccc 


catttgtcgt 


tgctgccatc 


ttattcgtat 


ttggtccacc 


900 


aactgttttg 


caaggttatg 


cttttgcatt 


tggttatatg 


aacctgtttg 


gtgttatcat 


960 


gcaaaatgtc 


tttccagccg 


ctcccccatg 


gtataaaatt 


ctctatggat 


tgcaatcagc 


1020 


caactatgat 


atgcatggct 


cgcctggtgg 


attagctaga 


attgataagc 


tactcggtat 


1080 


taatatgtat 


actacagctt 


tttcaaattc 


ctccgtcatt 


ttcggtgctt 


ttccttcact 


1140 


gcattccggg 


tgtgctacta 


tggaagccct 


gtttttctgt 


tattgttttc 


caaaattgaa 


1200 


gcccttgttt 


attgcttatg 


tttgctggtt 


atggtggtca 


actatgtatc 


tgacacacca 


1260 


ttattttgta 


gaccttatgg 


caggttctgt 


gctgtcatac 


gttattttcc 


agtacacaaa 


1320 


gtacacacat 


ttaccaattg 


tagatacatc 


tcttttttgc 


agatggtcat 


acacttcaat 


1380 


tgagaaatac 


gatatatcaa 


agagtgatcc 


attggctgca 


gattcaaacg 


atatcgaaag 


1440 


tgtccctttg 


tccaacttgg 


aacttgactt 


tgatcttaat 


atgactgatg 


aacccagtgt 


1500 


aagcccttcg 


ttatttgatg 


gatctacttc 


tgtttctcgt 


tcgtccgcca 


cgtctataac 


1560 


gtcactaggt 


gtaaagaggg 


cttaatgagt 


attttatctg 


caattacgga 


tacggttggt 


1620 


cttatgtaga 


tacatataaa 


tatatatctt 


tttctttctt 


tttcttagtc 


aggattgtcg 


1680 


tttagcataa 


tatacatgta 


gtttatttaa 


tcacatacca 


ctgattatct 


ttagaatttt 


1740 


ataaattttt 


gaaataaatg 


ggtggctttt 


aatggtgtct 


atgttaagtg 


aggcttttag 


1800 


aatgctcttc 


ctgctttgtt 


tattatatgt 


gtatgaaaga 


tatgtatgta 


tttacatgtg 


1860 


tttgtagcgt 


ccccagtcaa 


aacctgtgcg 


ctatacctaa 


atggattgat 


aatcttcatt 


1920 


cactaattct 


aaaatagact 


tcttccccaa 


agaacggtgt 


aacgatgagg 


ctctatccag 


1980 


ctgcttatct 


aaatcaactt 


taacgatgga 


tgatcttatg 


acacggggat 


ctttctttaa 


2040 


agttcttaga 


atttcagact 


gtaccgcagc 


tgatgaatca 


aacagcatta 


aaaagtgata 


2100 


tgctcgaaaa 


tgtttttcct 


ggtctttctt 


cattatttta 


ggaagatacc 


ttatgcccat 


2160 


gggtacaatg 


tccctcacca 


cacctctgtt 


ttgaataatc 
38 


agtttcccga 


ttgtggaaga 


2220 



caattctttt gcttccaact ttggcgcatt ggagttggtt atgcgaacaa gtccgatcag 2280 
ctcataaagc atcttagtga aaagggtggt tttgcgttat tctttcctct gttgaagctt 2340 

(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 401 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

Met Ala Asn Pro Phe Ser Arg Trp Phe Leu Ser Glu Arg Pro Pro 

15 10 15 

Asn Cys His val Ala Asp Leu Glu Thr Ser Leu Asp Pro His Gin 

20 25 30 

Thr Leu Leu Lys Val Gin Lys Tyr Lys Pro Ala Leu Ser Asp Trp 

35 40 45 

val His Tyr lie Phe Leu Gly Ser lie Met Leu Phe val Phe lie 

50 55 60 

Thr Asn Pro Ala Pro Trp lie Phe Lys lie Leu Phe Tyr Cys Phe 

65 70 75 

Leu Gly Thr Leu Phe lie lie Pro Ala Thr Ser Gin Phe Phe Phe 

80 85 90 

Asn Ala Leu Pro lie Leu Thr Trp val Ala Leu Tyr Phe Thr Ser 

95 100 105 

Ser Tyr Phe Pro Asp Asp Arg Arg Pro Pro lie Thr Val Lys val 

110 115 120 

Leu Pro Ala val Glu Thr lie Leu Tyr Gly Asp Asn Leu Ser Asp 

125 130 135 

lie Leu Ala Thr Ser Thr Asn Ser Phe Leu Asp lie Leu Ala Trp 

140 145 150 
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Leu Pro Tyr Gly Leu Phe His Phe Gly Ala Pro Phe val val Ala 

155 160 165 

Ala lie Leu Phe val Phe Gly Pro Pro Thr val Leu Gin Gly Tyr 

170 175 180 

Ala Phe Ala Phe Gly Tyr Met Asn Leu Phe Gly val lie Met Gin 

185 190 195 

Asn val Phe Pro Ala Ala Pro Pro Trp Tyr Lys lie Leu Tyr Gly 

200 205 210 

Leu Gin ser Ala Asn Tyr Asp Met His Gly ser Pro Gly Gly Leu 

215 220 225 

Ala Arg lie Asp Lys Leu Leu Gly lie Asn Met Tyr Thr Thr Ala 

230 235 240 

Phe Ser Asn ser ser val lie Phe Gly Ala Phe Pro Ser Leu His 

245 250 255 

Ser Gly Cys Ala Thr Met Glu Ala Leu Phe Phe cys Tyr Cys Phe 

260 265 270 

Pro Lys Leu Lys Pro Leu Phe lie Ala Tyr val Cys Trp Leu Trp 

275 280 285 

Trp Ser Thr Met Tyr Leu Thr His His Tyr Phe val Asp Leu Met 

290 295 300 

Ala Gly Ser val Leu Ser Tyr Val lie Phe Gin Tyr Thr Lys Tyr 

305 310 315 

Thr His Leu Pro lie val Asp Thr ser Leu Phe Cys Arg Trp ser 

320 325 330 

Tyr Thr Ser lie Glu Lys Tyr Asp lie ser Lys Ser Asp Pro Leu 

335 340 345 

Ala Ala Asp Ser Asn Asp lie Glu ser val Pro Leu Ser Asn Leu 

350 355 360 

Glu Leu Asp Phe Asp Leu Asn Met Thr Asp Glu Pro Ser val Ser 

365 370 375 

Pro Ser Leu Phe Asp Gly Ser Thr Ser val Ser Arg Ser Ser Ala 



380 



40 



385 



390 




Thr Ser lie Thr Ser Leu Gly Val Lys Arg Ala 
395 400 

(2) INFORMATION FOR SEQ ID NO: 23: 
(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5340 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: genomic dna 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 



agcgcttcta 


ttttcctccc 


caccgcgagg 


cggaaatggc 


acattttttt 


tcttttgctt 


60 


ctgtgctttt 


gctgtaattt 


ttggcatgtg 


ctattgtatg 


aagataacgc 


gtggttccgt 


120 


ggaaatagcc 


ggaaattttg 


ccgggaatat 


gacggacatg 


atttaacacc 


cgtggaaatg 


180 


aaaaaagcca 


aggtaagaaa 


gtggcaatat 


ttttcctaca 


aatagatctg 


ctgtccctta 


240 


gatgattacc 


atacatatat 


atatttatta 


cacacatctg 


tcagaggtag 


ctagcgaagg 


300 


tgtcactgaa 


atattttttg 


ttccagttag 


tataaatacg 


gaggtagaac 


agctctccgc 


360 


gtgtatatct 


ttttttgcgc 


tatacaagaa 


caggaagaac 


gcatttccat 


acctttttct 


420 


ccttacaggt 


gccctctgag 


tagtgtcacg 


aacgaggaaa 


aagattaata 


ttactgtttt 


480 


tatattcaaa 


aagagtaaag 


ccgttgctat 


atacgaatat 


gacgattacc 


gtgggggatg 


540 


cagtttcgga 


gacggagctg 


gaaaacaaaa 


gtcaaaacgt 


ggtactatct 


cccaaggcat 


600 


ctgcttcttc 


agacataagc 


acagatgttg 


ataaagacac 


atcgtcttct 


tgggatgaca 


660 


aatctttgct 


gcctacaggt 


gaatatattg 


tggacagaaa 


taagccccaa 


acctacttga 


720 


atagcgatga 


tatcgaaaaa 


gtgacagaat 


ctgatatttt 


ccctcagaaa 


cgtctgtttt 


780 


cattcttgca 


ctctaagaaa 


attccagaag 


taccacaaac 


cgatgacgag 


aggaagatat 


840 


atcctctgtt 


ccatacaaat 


attatctcta 


acatgttttt 


ttggtgggtt 


ctacccatcc 


900 


tgcgagttgg 


ttataagaga 


acgatacagc 


cgaacgatct 


cttcaaaatg 


gatccgagga 


960 


tgtctataga 


gaccctttat 


gacgactttg 


aaaaaaacat 


gatttactat 


tttgagaaga 


1020 


cgaggaaaaa 


ataccgtaaa 


agacatccag 


aagcgacaga 


agaagaggtt 


atggaaaatg 


1080 


ccaaactacc 


taaacataca 


gttctgagag 


ctttattatt 


cacttttaag 


aaacagtact 


1140 


tcatgtcgat 


agtgtttgca 


attctcgcta 


attgtacatc 
41 


cggttttaac 


cccatgatta 


1200 



ccaagaggct 


aattgagttt 


gtcgaagaaa 


aaggtattgg 


ttacgctatt 


ggtgcatgtt 


atcatttctt 


tcatacatcc 


caactgactg 


ctgccatgaa 


gaaaatgttt 


aatgcatcta 


aagtgacttc 


ttttgtaaca 


acagatctcg 


cgtttttggc 


tgggttccct 


gcaattttgg 


ttggacccat 


tgccttagtt 


gggattggta 


ttgcatttaa 


gttaattctg 


ggctttagaa 


ttaccatgat 


gagagaagtg 


ctgaataata 


atgcgtatga 


aaaaaatatt 


caagatatta 


tgcaactatc 


aagaaatttc 


ttgattgcta 


tggtcacttt 


ccttgcaatg 


tacaaagtta 


ttgcctcttt 


atctttattt 


caggtcttga 


ttggtactgg 


aattgacatg 


atcattggat 


cagaagatga 


tccaaatcag 


atgattgaaa 


tggctctaaa 


aatgacacat 


tgctcatttg 


ttgaagaagc 


aaaaggagaa 


gctaaagatg 


atacatgggg 


taagccatct 


gcaagtacta 


aagacagaga 


cggcccggaa 


gatttagaaa 


acttcgatat 


taaaaagggc 


gaatttatta 


cttcattatt 


gaatgcgatg 


gcaggatcaa 


acggggactt 


attaatgtgt 


ggttatccat 


tcatattcgg 


ttcaccattc 


aataaagaaa 


tgaaagctga 


tctggatatt 


ttaccggcag 


ttactttatc 


tggtggtcaa 


aaggcacgta 


aggatattta 


tgtattcgac 


gatgtcctaa 


tcatggatga 


atgtctaacc 


ggaatgcttg 


agttgtcact 


qattqaqaaa 


acttctaaaa 


atattggtac 


tgttgatgag 


ctaaaagctc 


tctcttctca 


aaattcggag 


aaagaggatg 


tgggacaact 


aaaatatgaa 


ccagaggtaa 


cagaaatgtc 


acaaactgca 


aatagtggta 



aggctatttt 


tcatagcatg 


catgttaaca 


1260 


tgatgatgtt 


cgttaacggg 


ttgacgttca 


1320 


gtgtgcaagc 


taagtctatt 


cttactaaag 


1380 


attatgcgag 


acattgtttt 


cctaacggta 


1440 


ctagaattga 


atttgcctta 


tcttttcagc 


1500 


ctatttgcat 


tgttttattg 


atcgttaacc 


1560 


tttttttcgg 


tgggtttttc 


atatccttat 


1620 


ttgctgcgaa 


catcttcact 


gatgctagag 


1680 


taaaaatgat 


taaatattat 


acgtgggagg 


1740 


ggaccaaaga 


gatttctaaa 


gttagaaaaa 


1800 


tggccatgtc 


tttgcctagt 


attgcttcat 


1860 


ataaaggagg 


caggcaacct 


ggtaatattt 


1920 


gtttgcaaat 


gtttttctta 


cctattgcta 


1980 


tgggccgttt 


gcaaagctta 


ttggaggctc 


2040 


tgaagccctc 


tcctggcttt 


gatccaaaat 


2100 


agtgggaaga 


ttatgaatta 


aacgacgcta 


2160 


aaggtaaaaa 


gaacaaaaaa 


aagcgtaagg 


2220 


ataaggcgaa 


aagattggac 


aatatgttga 


2280 


aaacttcgtt 


taggggtttc 


aaggacttga 


2340 


tgattacggg 


acctattggt 


actggtaaat 


2400 


tgagaaaaat 


tgatggtaag 


gttgaagtca 


2460 


ggattcaaaa 


tgcatctgta 


agagataaca 


2520 


agtatgatga 


agtagttcgt 


gtttgctctt 


2580 


gcgatatgac 


cgaaattggg 


gaacgtggta 


2640 


tcaatttagc 


caggtctgtt 


tataagaaga 


2700 


gtgctgtcga 


ttctcgtgtt 


ggtaaacaca 


2760 


ctaataaaac 


cagaatttta 


gcaacgcatc 


2820 


tcatcgtttt 


aggtactgat 


ggccaagtcg 


2880 


gtaatcaaac 


tttgataaat 


cttttacaat 


2940 


aagaacagga 


agcggttgtt 


tccggtgaat 


3000 


aggaattgac 


tgaactgaag 


aaaaaggcta 


3060 


aaattgtagc 
42 


ggatggtcat 


actagtagta 


3120 



aagaagaaag 


agcagtcaat 


agtatcagtc 


cagtaggtaa 


gtggggtttt 


atcgcactac 


cattctgctc 


acttttttct 


tccgtttggt 


acagaccacc 


cagtttttat 


atgggtcttt 


tcatgaatgg 


ccagttcacc 


atactttgcg 


atttgagggc 


tgtgaaaaga 


attttacaca 


tgggacgtat 


tctgaacaga 


ttcacaaaag 


aaagtttacg 


gttgatgaca 


tctcaatttg 


ttgtttactt 


gccgtggttt 


gctatcgcaa 


ttgctgatca 


ttatcagagt 


tctggtagag 


cttttgttta 


caataattta 


aatgaagttt 


gaagtcagga 


acgatttttg 


gcgaaatcag 


gataccttgt 


agttgtcctg 


caaagatggg 


catttgcact 


aattattacg 


ttattgtgtg 


cagttggtgt 


tttgttgact 


tatgtattac 


gggcaatgac 


tcaaacagag 


aatgacatga 


ctgaactacc 


actagaggca 


tcctatagaa 


cctcaatggg 


cgaaataatt 


tttgaaaatg 


tagttttaaa 


aaatcttaac 


ttgaatatca 


gtacaggtgc 


tggtaagtcc 


actattatga 


caggtaaaat 


tttaattgac 


aatgttgata 


gaaaattagc 


catcattcca 


caagatccag 


tagatccatt 


taatgagcgt 


acagatgacg 


ctatcgccaa 


ggatgacttg 


ccggaagtga 


atggtaaaat 


gcataagttc 


catttagatc 


ccttaggtga 


gagacaacta 


ttagcattaa 


tgattttgga 


tgaggctaca 


tcctcagtgg 


gtattgttga 


ggaatttgga 


gattgtacaa 


ttgtaaatta 


tgatcgtatt 


cttgttttag 


catggacgtt 


gtttagtcaa 


gaagatagta 


ttgtggaaaa 


tgatttcgag 


aacagaagtt 


cttttattta 


tttatatgtt 


gccgatggta 



tgaaaatata 


ccgtgaatac 


attaaagctg 


3180 


cgttgtatgc 


aattttagtc 


gttggaacca 


3240 


tatcttactg 


gactgagaat 


aaattcaaaa 


3300 


actccttctt 


tgtgtttgct 


gctttcatat 


3360 


caatgggtat 


tatggcatcg 


aaatggttaa 


3420 


ctccaatgtc 


atacatagat 


accacacctt 


3480 


atacagatag 


cttagataat 


gagttaaccg 


3540 


ctaatattgt 


aggtgtttgc 


gtcatgtgta 


3600 


ttccgtttct 


tttggtcatc 


tttgttctga 


3660 


aaattaaaag 


acttgaagct 


gtgcaacggt 


3720 


tgggtgggat 


ggatacaatc 


aaagcatacc 


3780 


attttttgat 


caacaagatg 


aatgaggcgg 


3840 


taggtatttt 


ccttgatatg 


gttgctatcg 


3900 


ttacgagagc 


ctttcctatt 


tccgcggctt 


3960 


aattgcctgg 


tctattaaat 


accattttaa 


4020 


atagtgccga 


aagattggta 


acatatgcaa 


4080 


agcccgaaat 


gacacctcca 


gagtcatggc 


4140 


ttgattttgc 


ctatagacct 


ggtttaccta 


4200 


agagtgggga 


aaaaattggt 


atctgtggtc 


4260 


gtgcccttta 


caggttgaat 


gaattgaccg 


4320 


taagtcagct 


gggacttttc 


gatttaagaa 


4380 


tattatttag 


gggtacgatt 


cgcaagaact 


4440 


aattatggga 


tgcattggtg 


agaggtggtg 


4500 


aattgcaaaa 


acctgatgaa 


aatggtactc 


4560 


aagcagtgga 


agaagagggc 


tccaatttct 


4620 


caagggcatt 


ggtccgccaa 


tcaaaaatat 


4680 


actacgaaac 


ggatggcaaa 


atccaaacac 


4740 


ttttgtgtat 


tgctcacaga 


ctgaagacca 


4800 


agaagggtga 


agtcgcagaa 


ttcgatacac 


4860 


ttttcagaag 


catgtgttct 


agatctggta 


4920 


aatttatatt 


atttgttgca 


tgatttttct 


4980 


caaattagta 
43 


ctagaaaaga 


aaacccacta 


5040 



ctatgacttg cagaaaaagt tatgtgtgcc atagatagat ataattgcat acccacatcg 5100 

tatactcaaa attccgaaaa gaacatttca ttttttatga ggcaaactga acaacgcttc 5160 

ggtccttttt tcattctaga aatatatatt tatacatcat tttcagaaga tattcaaaga 5220 

acttattggg atgtctattt actgaataaa gtatacacaa aaaacgaatt taaaatggaa 5280 

ggcataaata gaaaacttag aagtgaaaat cctaaaaccg aaggatattt caaatacgta 5340 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1477 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) molecule type: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

Met Thr He Thr Val Gly Asp Ala Val Ser Glu Thr Glu Leu Glu 

5 10 15 

Asn Lys Ser Gin Asn Val val Leu Ser Pro Lys Ala Ser Ala Ser 

20 25 30 

Ser Asp lie Ser Thr Asp val Asp Lys Asp Thr Ser Ser Ser Trp 

35 40 45 

Asp Asp Lys Ser Leu Leu Pro Thr Gly Glu Tyr lie val Asp Arg 

50 55 60 

Asn Lys Pro Gin Thr Tyr Leu Asn Ser Asp Asp lie Glu Lys Val 

65 70 75 

Thr Glu Ser Asp lie Phe Pro Gin Lys Arg Leu Phe Ser Phe Leu 

80 85 90 

His Ser Lys Lys lie Pro Glu Val Pro Gin Thr Asp Asp Glu Arg 

95 100 105 

Lys lie Tyr Pro Leu Phe His Thr Asn lie lie ser Asn Met Phe 

110 115 120 

Phe Trp Trp val Leu Pro lie Leu Arg Val Gly Tyr Lys Arg Thr 
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125 130 135 

lie Gin Pro Asn Asp Leu Phe Lys Met Asp Pro Arg Met Ser lie 

140 145 150 

Glu Thr Leu Tyr Asp Asp Phe Glu Lys Asn Met lie Tyr Tyr Phe 

155 160 165 

Glu Lys Thr Arg Lys Lys Tyr Arg Lys Arg His Pro Glu Ala Thr 

170 175 180 

Glu Glu Glu val Met Glu Asn Ala Lys Leu Pro Lys His Thr val 

185 190 195 

Leu Arg Ala Leu Leu Phe Thr Phe Lys Lys Gin Tyr Phe Met Ser 

200 205 210 

lie val Phe Ala lie Leu Ala Asn cys Thr Ser Gly Phe Asn Pro 

215 220 225 

Met lie Thr Lys Arg Leu lie Glu Phe val Glu Glu Lys Ala lie 

230 235 240 

Phe His ser Met His val Asn Lys Gly He Gly Tyr Ala lie Gly 

245 250 255 

Ala Cys Leu Met Met Phe val Asn Gly Leu Thr Phe Asn His Phe 

260 265 270 

Phe His Thr Ser Gin Leu Thr Gly Val Gin Ala Lys Ser lie Leu 

275 280 285 

Thr Lys Ala Ala Met Lys Lys Met Phe Asn Ala Ser Asn Tyr Ala 

290 295 300 

Arg His cys Phe Pro Asn Gly Lys val Thr Ser Phe val Thr Thr 

305 310 315 

Asp Leu Ala Arg lie Glu Phe Ala Leu Ser Phe Gin Pro Phe Leu 

320 325 330 

Ala Gly Phe Pro Ala lie Leu Ala lie cys lie val Leu Leu lie 

335 340 345 

val Asn Leu Gly Pro lie Ala Leu val Gly lie Gly lie Phe Phe 

350 355 360 

Gly Gly Phe Phe lie ser Leu Phe Ala Phe Lys Leu lie Leu Gly 
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365 370 375 

Phe Arg lie Ala Ala Asn lie Phe Thr Asp Ala Arg Val Thr Met 

380 385 390 

Met Arg Glu val Leu Asn Asn He Lys Met lie Lys Tyr Tyr Thr 

395 400 405 

Trp Glu Asp Ala Tyr Glu Lys Asn lie Gin Asp lie Arg Thr Lys 

410 415 420 

Glu lie Ser Lys Val Arg Lys Met Gin Leu Ser Arg Asn Phe Leu 

425 430 435 

lie Ala Met Ala Met Ser Leu Pro Ser lie Ala ser Leu val Thr 

440 445 450 

Phe Leu Ala Met Tyr Lys val Asn Lys Gly Gly Arg Gin Pro Gly 

455 460 465 

Asn He Phe Ala Ser Leu Ser Leu Phe Gin val Leu Ser Leu Gin 

470 475 480 

Met Phe Phe Leu Pro lie Ala lie Gly Thr Gly lie Asp Met lie 

485 490 495 

lie Gly Leu Gly Arg Leu Gin Ser Leu Leu Glu Ala Pro Glu Asp 

500 505 510 

Asp Pro Asn Gin Met lie Glu Met Lys Pro ser Pro Gly Phe Asp 

515 520 525 

Pro Lys Leu Ala Leu Lys Met Thr His Cys Ser Phe Glu Trp Glu 

530 535 540 

Asp Tyr Glu Leu Asn Asp Ala lie Glu Glu Ala Lys Gly Glu Ala 

545 550 555 

Lys Asp Glu Gly Lys Lys Asn Lys Lys Lys Arg Lys Asp Thr Trp 

560 565 570 

Gly Lys Pro Ser Ala Ser Thr Asn Lys Ala Lys Arg Leu Asp Asn 

575 580 585 

Met Leu Lys Asp Arg Asp Gly Pro Glu Asp Leu Glu Lys Thr ser 

590 595 600 

Phe Arg Gly Phe Lys Asp Leu Asn Phe Asp lie Lys Lys Gly Glu 
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605 610 615 

Phe lie Met lie Thr Gly Pro lie Gly Thr Gly Lys Ser ser Leu 

620 625 630 

Leu Asn Ala Met Ala Gly Ser Met Arg Lys lie Asp Gly Lys Val 

635 640 645 

Glu val Asn Gly Asp Leu Leu Met Cys Gly Tyr Pro Trp lie Gin 

650 655 660 

Asn Ala ser val Arg Asp Asn lie lie Phe Gly Ser Pro Phe Asn 

665 670 675 

Lys Glu Lys Tyr Asp Glu Val val Arg Val Cys Ser Leu Lys Ala 

680 685 690 

Asp Leu Asp lie Leu Pro Ala Gly Asp Met Thr Glu lie Gly Glu 

695 700 705 

Arg Gly lie Thr Leu Ser Gly Gly Gin Lys Ala Arg lie Asn Leu 

710 715 720 

Ala Arg Ser Val Tyr Lys Lys Lys Asp lie Tyr Val Phe Asp Asp 

725 730 735 

val Leu Ser Ala val Asp Ser Arg val Gly Lys His lie Met Asp 

740 745 750 

Glu Cys Leu Thr Gly Met Leu Ala Asn Lys Thr Arg lie Leu Ala 

755 760 765 

Thr His Gin Leu Ser Leu lie Glu Arg Ala Ser Arg Val lie Val 

770 775 780 

Leu Gly Thr Asp Gly Gin val Asp lie Gly Thr val Asp Glu Leu 

785 790 795 

Lys Ala Arg Asn Gin Thr Leu lie Asn Leu Leu Gin Phe Ser Ser 

800 805 810 

Gin Asn Ser Glu Lys Glu Asp Glu Glu Gin Glu Ala Val val Ser 

815 820 825 

Gly Glu Leu Gly Gin Leu Lys Tyr Glu Pro Glu Val Lys Glu Leu 

830 835 840 

Thr Glu Leu Lys Lys Lys Ala Thr Glu Met ser Gin Thr Ala Asn 
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845 850 855 
Ser Gly Lys lie val Ala Asp Gly His Thr ser ser Lys Glu Glu 

860 865 870 

Arg Ala val Asn Ser lie Ser Leu Lys lie Tyr Arg Glu Tyr lie 

875 880 885 

Lys Ala Ala Val Gly Lys Trp Gly Phe lie Ala Leu Pro Leu Tyr 

890 895 900 

Ala lie Leu val val Gly Thr Thr Phe cys Ser Leu Phe Ser ser 

905 910 915 

Val Trp Leu Ser Tyr Trp Thr Glu Asn Lys Phe Lys Asn Arg Pro 

920 925 930 

Pro Ser Phe Tyr Met Gly Leu Tyr Ser Phe Phe val Phe Ala Ala 

935 940 945 

Phe lie Phe Met Asn Gly Gin Phe Thr lie Leu Cys Ala Met Gly 

950 955 960 

lie Met Ala ser Lys Trp Leu Asn Leu Arg Ala val Lys Arg He 

965 970 975 
Leu His Thr Pro Met Ser Tyr lie Asp Thr Thr Pro Leu Gly Arg 

980 985 990 

lie Leu Asn Arg Phe Thr Lys Asp Thr Asp Ser Leu Asp Asn Glu 

995 1000 1005 

Leu Thr Glu Ser Leu Arg Leu Met Thr Ser Gin Phe Ala Asn lie 

1010 1015 1020 

val Gly val cys val Met Cys lie val Tyr Leu Pro Trp Phe Ala 

1025 1030 1035 

lie Ala lie Pro Phe Leu Leu val lie Phe val Leu lie Ala Asp 

1040 1045 1050 

His Tyr Gin Ser ser Gly Arg Glu lie Lys Arg Leu Glu Ala val 

1055 1060 1065 

Gin Arg ser Phe Val Tyr Asn Asn Leu Asn Glu val Leu Gly Gly 

1070 1075 1080 

Met Asp Thr lie Lys Ala Tyr Arg Ser Gin Glu Arg Phe Leu Ala 
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1085 1090 1095 

Lys Ser Asp Phe Leu lie Asn Lys Met Asn Glu Ala Gly Tyr Leu 

1100 1105 1110 

val val val Leu Gin Arg Trp val Gly lie Phe Leu Asp Met val 

1115 1120 1125 

Ala lie Ala Phe Ala Leu lie lie Thr Leu Leu cys val Thr Arg 

1130 1135 1140 

Ala Phe Pro lie Ser Ala Ala Ser Val Gly val Leu Leu Thr Tyr 

1145 1150 1155 

Val Leu Gin Leu Pro Gly Leu Leu Asn Thr lie Leu Arg Ala Met 

1160 1165 1170 

Thr Gin Thr Glu Asn Asp Met Asn ser Ala Glu Arg Leu val Thr 

1175 1180 1185 

Tyr Ala Thr Glu Leu Pro Leu Glu Ala Ser Tyr Arg Lys Pro Glu 

1190 1195 1200 

Met Thr Pro Pro Glu Ser Trp Pro Ser Met Gly Glu lie lie Phe 

1205 1210 1215 

Glu Asn val Asp Phe Ala Tyr Arg Pro Gly Leu Pro lie val Leu 

1220 1225 1230 

Lys Asn Leu Asn Leu Asn lie Lys Ser Gly Glu Lys lie Gly lie 

1235 1240 1245 

Cys Gly Arg Thr Gly Ala Gly Lys Ser Thr He Met Ser Ala Leu 

1250 1255 1260 

Tyr Arg Leu Asn Glu Leu Thr Ala Gly Lys lie Leu lie Asp Asn 

1265 1270 1275 

val Asp lie Ser Gin Leu Gly Leu Phe Asp Leu Arg Arg Lys Leu 

1280 1285 1290 

Ala lie lie Pro Gin Asp Pro val Leu Phe Arg Gly Thr lie Arg 

1295 1300 1305 

Lys Asn Leu Asp Pro Phe Asn Glu Arg Thr Asp Asp Glu Leu Trp 

1310 1315 1320 

Asp Ala Leu Val Arg Gly Gly Ala lie Ala Lys Asp Asp Leu Pro 
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1325 1330 1335 

Glu val Lys Leu Gin Lys Pro Asp Glu Asn Gly Thr His Gly Lys 

1340 1345 1350 

Met His Lys Phe His Leu Asp Gin Ala val Glu Glu Glu Gly Ser 

1355 1360 1365 

Asn Phe Ser Leu Gly Glu Arg Gin Leu Leu Ala Leu Thr Arg Ala 

1370 1375 1380 

Leu Val Arg Gin Ser Lys lie Leu lie Leu Asp Glu Ala Thr Ser 

1385 1390 1395 

Ser val Asp Tyr Glu Thr Asp Gly Lys lie Gin Thr Arg lie val 

1400 1405 1410 

Glu Glu Phe Gly Asp Cys Thr lie Leu Cys lie Ala His Arg Leu 

1415 1420 1425 

Lys Thr lie Val Asn Tyr Asp Arg lie Leu val Leu Glu Lys Gly 

1430 1435 1440 

Glu val Ala Glu Phe Asp Thr Pro Trp Thr Leu Phe Ser Gin Glu 

1445 1450 1455 

Asp Ser lie Phe Arg ser Met Cys Ser Arg ser Gly lie val Glu 

1460 1465 1470 

Asn Asp Phe Glu Asn Arg Ser 

1475 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) length: 26 bases 

(B) type: nucleic acid 

(C) STRANDEDNESS: single 

(D) topology: linear 

(ii) MOLECULE type: other nucleic acid (synthetic DNA) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 



tttggttaya tgaayytntt yggngt 26 
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(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) molecule type: other nucleic acid (synthetic dna) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

tctacaaart artggtgngt narrtacat 29 

(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) length: 2274 base pairs 

(B) type: nucleic acid 

(C) STRANDEDNESS: double 

(D) topology: linear 

(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 



ttatatatat 


tattgatttg 


ttcctgttgt tatttagttt 


agaatcagac 


gactacacca 


60 


gaaccacaat 


tcaaccaaca 


cttatataga acctggcttg 


gaaaaaagta 


acatttatca 


120 


ttcctatact 


tttttagcaa acataatccg tgttttacat 


atattattca 


cccaatatca 


180 


taacaaaaac 


aaactgaata 


atggcgtctt ctattttgcg 


ttccaaaata 


atacaaaaac 


240 


cgtaccaatt 


attccactac 


tattttcttc tggagaaggc 


tcctggttct 


acagttagtg 


300 


atttgaattt 


tgatacaaac 


atacaaacga gtttacgtaa 


attaaagcat 


catcattgga 


360 


cggtgggaga 


aatattccat 


tatgggtttt tggtttccat 


actttttttc 


gtgtttgtgg 


420 


ttttcccagc 


ttcatttttt 


ataaaattac caataatctt 


agcatttgct 


acttgttttt 


480 


taataccctt 


aacatcacaa 


ttttttcttc ctgccttgcc 


cgttttcact 


tggttggcat 


540 


tatattttac 


gtgtgctaaa atacctcaag aatggaaacc 


agctatcaca 


gttaaagttt 


600 


taccagctat 


ggaaacaatt 


ttgtacggcg ataatttatc 
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aaatgttttg 


gcaaccatca 


660 



ctaccggagt 


gttagatata 


ttggcatggt 


taccatatgg 


gattattcat 


ttcagtttcc 


720 


catttgtact 


tgctgctatt 


atatttttat 


ttgggccacc 


gacggcatta 


agatcatttg 


780 


gatttgcctt 


tggttatatg 


aacttgcttg 


gagtcttgat 


tcaaatggca 


ttcccagctg 


840 


ctcctccatg 


gtacaaaaac 


ttgcacggat 


tagaaccagc 


taattattca 


atgcacgggt 


900 


ctcctggtgg 


acttggaagg 


atagataaat 


tgttaggtgt 


tgatatgtat 


accacagggt 


960 


tttccaattc 


atcaatcatt 


tttggggcat 


tcccatcgtt 


acattcagga 


tgttgtatca 


1020 


tggaagtgtt 


atttttgtgt 


tggttgtttc 


cacgattcaa 


gtttgtgtgg 


gttacatacg 


1080 


catcttggct 


ttggtggagc 


acgatgtatt 


tgacccatca 


ctactttgtc 


gatttgattg 


1140 


gtggagccat 


gctatctttg 


actgtttttg 


agttcaccaa 


atataaatat 


ttgccaaaaa 


1200 


acaaagaagg 


ccttttctgt 


cgttggtcat 


acactgaaat 


tgaaaaaatc 


gatatccaag 


1260 


agattgaccc 


tttatcatac 


aattatatcc 


ctgtcaacag 


caatgataat 


gaaagcagat 


1320 


tgtatacgag 


agtgtaccaa 


gagtctcagg 


ttagtccccc 


acagagagct 


gaaacacctg 


1380 


aagcatttga 


gatgtcaaat 


ttttctaggt 


ctagacaaag 


ctcaaagact 


caggttccat 


1440 


tgagtaatct 


tactaacaat 


gatcaagtgt 


ctggaattaa 


cgaagaggat 


gaagaagaag 


1500 


aaggcgatga 


aatttcatcg 


agtactcctt 


cggtgtttga 


agacgaacca 


cagggtagca 


1560 


catatgctgc 


atcctcagct 


acatcagtag 


atgatttgga 


ttccaaaaga 


aattagtaaa 


1620 


ataacagttt 


ctattaattt 


ctttatttcc 


tcctaattaa 


tgattttatg 


ctcaatacct 


1680 


acactatctg 


tttttaattt 


cctacttttt 


ttttattatt 


gttgagttca 


tttgctgttc 


1740 


attgaatatt 


tacaattttg 


cattaattac 


catcaatata 


gaatgggcac 


agttttttta 


1800 


agtttttttg 


tttttgtgtt 


tgtctttctt 


tttttacatt 


aatgtgtttg 


gattgtttta 


1860 


ggttccttta 


tcccttagcc 


ccctcagaat 


actattttat 


ctaattaatt 


tgtttttatt 


1920 


ttctgatatt 


taccaattgc 


tttttctttt 


ggatatttat 


aatagcatcc 


cctaataatt 


1980 


aatatacaac 


tgtttcatat 


atatacgtgt 


atgtcctgta 


gtggtggaaa 

ZJ ZJ ZJ ZJ zj 


ctggagtcaa 


2040 


catttgtatt 


aatgtgtaca 


agaaagcagt 


gttaatgcta 


ctattataat 


ttttgaggtg 


2100 


caaatcaaga 


ggttggcagc 


tttcttatgg 


ctatgaccgt 


gaatgaaggc 


ttgtaaacca 


2160 


cgtaataaac 


aaaagccaac 


aagttttttt 


agagccttta 


acaacatacg 


caatgagagt 


2220 


gattgcaata 


ctacaagata 


tagcccaaaa 


aattgaatgc 


atttcaacaa 


caac 


2274 



(2) INFORMATION FOR SEQ ID NO: 28: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 471 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS: single 

(D) topology: linear 
(ii) molecule type: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

Met Ala Ser Ser lie Leu Arg Ser Lys lie lie Gin Lys Pro Tyr 

5 10 15 

Gin Leu Phe His Tyr Tyr Phe Leu Ser Glu Lys Ala Pro Gly Ser 

20 25 30 

Thr val Ser Asp Leu Asn Phe Asp Thr Asn lie Gin Thr Ser Leu 

35 40 45 

Arg Lys Leu Lys His His His Trp Thr val Gly Glu lie Phe His 

50 55 60 

Tyr Gly Phe Leu val Ser lie Leu Phe Phe val Phe val Val Phe 

65 70 75 

Pro Ala Ser Phe Phe lie Lys Leu Pro lie lie Leu Ala Phe Ala 

80 85 90 

Thr Cys Phe Leu lie Pro Leu Thr Ser Gin Phe Phe Leu Pro Ala 

95 100 105 

Leu Pro Val Phe Thr Trp Leu Ala Leu Tyr Phe Thr Cys Ala Lys 

110 115 120 

lie Pro Gin Glu Trp Lys Pro Ala lie Thr Val Lys Val Leu Pro 

125 130 135 

Ala Met Glu Thr lie Leu Tyr Gly Asp Asn Leu Ser Asn Val Leu 

140 145 150 

Ala Thr lie Thr Thr Gly val Leu Asp lie Leu Ala Trp Leu Pro 

155 160 165 

Tyr Gly lie lie His Phe Ser Phe Pro Phe Val Leu Ala Ala lie 

170 175 180 

lie Phe Leu Phe Gly Pro Pro Thr Ala Leu Arg Ser Phe Gly Phe 

185 190 195 

Ala Phe Gly Tyr Met Asn Leu Leu Gly Val Leu lie Gin Met Ala 
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200 205 210 

Phe Pro Ala Ala Pro Pro Trp Tyr Lys Asn Leu His Gly Leu Glu 

215 220 225 

Pro Ala Asn Tyr ser Met His Gly Ser Pro Gly Gly Leu Gly Arg 

230 235 240 

lie Asp Lys Leu Leu Gly val Asp Met Tyr Thr Thr Gly Phe Ser 

245 250 255 

Asn ser Ser lie lie Phe Gly Ala Phe Pro Ser Leu His Ser Gly 

260 265 270 

cys cys lie Met Glu val Leu Phe Leu Cys Trp Leu Phe Pro Arg 

275 280 285 

Phe Lys Phe Val Trp val Thr Tyr Ala Ser Trp Leu Trp Trp Ser 

290 295 300 

Thr Met Tyr Leu Thr His His Tyr Phe val Asp Leu lie Gly Gly 

305 310 315 

Ala Met Leu Ser Leu Thr val Phe Glu Phe Thr Lys Tyr Lys Tyr 

320 325 330 

Leu Pro Lys Asn Lys Glu Gly Leu Phe Cys Arg Trp Ser Tyr Thr 

335 340 345 

Glu lie Glu Lys lie Asp lie Gin Glu lie Asp Pro Leu ser Tyr 

350 355 360 

Asn Tyr lie Pro val Asn ser Asn Asp Asn Glu Ser Arg Leu Tyr 

365 370 375 

Thr Arg val Tyr Gin Glu Ser Gin val Ser Pro Pro Gin Arg Ala 

380 385 390 

Glu Thr Pro Glu Ala Phe Glu Met Ser Asn Phe Ser Arg Ser Arg 

395 400 405 

Gin Ser Ser Lys Thr Gin val Pro Leu Ser Asn Leu Thr Asn Asn 

410 415 420 

Asp Gin val Ser Gly lie Asn Glu Glu Asp Glu Glu Glu Glu Gly 

425 430 435 

Asp Glu lie Ser ser ser Thr Pro ser val Phe Glu Asp Glu Pro 
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440 445 450 

Gin Gly Ser Thr Tyr Ala Ala Ser Ser Ala Thr Ser Val Asp Asp 

455 460 465 

Leu Asp Ser Lys Arg Asn 

470 

(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 243 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE type: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

tttgaaaaat ttgaatttta aaattaatcc aatggaaaaa attggtattt gtggaagaac 60 

cggtgctggt aaatcatcaa ttatgacagc attatatcga ttatcagaat tagaactggg 120 

gaaaattatt attgatgata ttgatatttc aactttgggt ttaaaagatc ttcgatcaaa 180 

attatcaatt attcctcaag atccagtatt attccgaggt tcaattcgga aaaacttgga 240 

tec 243 

(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) length: 80 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) topology: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

Leu Lys Asn Leu Asn Phe Lys lie Asn Pro Met Glu Lys lie Gly 

5 10 15 
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lie Cys Gly Arg Thr Gly Ala Gly Lys Ser Ser lie Met Thr Ala 

20 25 30 

Leu Tyr Arg Leu Ser Glu Leu Glu Leu Gly Lys lie lie lie Asp 

35 40 45 

Asp lie Asp lie Ser Thr Leu Gly Leu Lys Asp Leu Arg Ser Lys 

50 55 60 

Leu ser lie lie Pro Gin Asp Pro Val Leu Phe Arg Gly Ser lie 

65 70 75 

Arg Lys Asn Leu Asp 

80 



(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1601 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) topology: linear 

(ii) MOLECULE TYPE: genomic dna 
(iv) ANTI-SENSE: Yes 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 



aggaagatga 


cttgcatcaa 


agatggagga agtggtactg 


gcaggacgat 


caatcaaatc 


60 


agcagcagga 


ctaggtaacg 


gctcaggtga 


tgatgaaccc 


acggaccatt 


catgatcggt 


120 


gttagcaagt 


tccatattgt 


taagaccact 


catgaaggct 


actgcattag 


ggttttgagt 


180 


aaaagaatcc 


cttccaagta 


agtatgggct 


gccggtacga 


gccaaggagt 


tgctggtttt 


240 


ttcggaaaga 


ccatgaccgt 


ggataacaaa 


ctcgtattcc 


caacgaagga 


ttttaccagt 


300 


ttgcaactgt 


gggaggcgta 


gcttttgagc 


aaaaacgaag 


catataatag 


ctaaacacat 


360 


accgccgacc 


aaatctacaa 


agtagtggtg 


ggtaaggtac 


atagtacacc 


agcaaagcca 


420 


tagaacatat 


ccataaaagc 


agaagcggta 


tcgaggaaac 


acatgcgaaa 


ggaaaagtgc 


480 


ttccagcatg 


gcccatccag 


cgtgaagaga 


tggaaaggca 


ccaaaaacaa 


ccggagagtt 


540 


agaaaaacca 


tcagtgtaaa 


tgctagtgcc 


gaagagagca 


tcaatacggg 


ccaatccacc 


600 


aggagagcca 


cgtactgcat 


acgtggcagg 


ttctaaacca 


tacatatttt 


cataccaagg 


660 
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aggagaacag 


gggaaagcca 


tttggataag 


aacaccaaat 


aaattcatat aaccaaaagt 


720 


tcgagcccaa 


actggaagag 


ttccaggagg 


tgcaaagatg 


aaaagaataa atgaaatgat 


780 


aaaaggagcc 


gaataatgca 


tgactccata 


tggaacccag 


gccaaaatat caaggatgct 


840 


atgcgtggtt 


ttcgagagaa 


gactagaaag 


attagagcca 


taaagaatat tttcaagtgt 


900 


gggtaaaaca 


cgaacccata 


tgggtggacg 


ccagcgttct 


ggaataaacc tacaagagta 


960 


aaataaaatt 


gcccaggtga 


tgataacaat 


ggcaggaaaa 


aaaatttggc gtgttaaagg 


1020 


aacggtcaac 


gcaatggcca 


aaagacaggc 


aatgccaaat 


ttcccccaga atccaggaga 


1080 


ttcaatgaca 


atacaagcaa 


aaatcaaatt 


acctgctaga 


aacacatatt gcaaatgtgt 


1140 


ccatgaccat 


ttcgtattgc 


gtagcaaacg 


aaatgtaggc 


atagggttta agcttgtttc 


1200 


caacttgtat 


tgggatgctc 


ggttacacgc 


agcaaggcgc 


ttttttaagg tcgaaagagc 


1260 


agacattgct 


tcaaagaatt 


atcagagtaa 


aaaagggaag 


cgtacgaaaa aaatttcgta 


1320 


aggaattaac 


cggaaaacta 


aaggaaaaaa 


aaggaatttt 


tatgaaggaa agaaagtagc 


1380 


tattaaatgc 


aagtgtcaag 


cacttaaaag 


tagcgatgta 


aaatatttaa aaaaagatgg 


1440 


accgattaac 


caatgttcag 


ctcacagttg 


ccagcaatca 


gggctatttt tttatttttt 


1500 


ttataaaatt 


gctaattata 


tataatataa 


ttagtttatt 


aacttgcttt tcctcaaaaa 


1560 


accaattcga 


gaaaggaact 


tttgcagagg 


caaaaaagct 


t 
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(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1601 bases 

(B) type: nucleic acid 
(c) strandedness: double 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: mRNA 
(iv) anti-sense: Yes 

(xi) sequence description: seq id no: 32: 



aggaagauga cuugcaucaa agauggagga 

agcagcagga cuagguaacg gcucagguga 

guuagcaagu uccauauugu uaagaccacu 

aaaagaaucc cuuccaagua aguaugggcu 

uucggaaaga ccaugaccgu ggauaacaaa 



agugguacug gcaggacgau caaucaaauc 60 

ugaugaaccc acggaccauu caugaucggu 120 

caugaaggcu acugcauuag gguuuugagu 180 

gccgguacga gccaaggagu ugcugguuuu 240 

cucguauucc caacgaagga uuuuaccagu 300 
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uugcaacugu 


gggaggcgua 


gcuuuugagc 


aaaaacgaag 


cauauaauag 


cuaaacacau 


360 


accgccgacc 


aaaucuacaa 


aguaguggug 


gguaagguac 


auaguacacc 


agcaaagcca 


420 


uagaacauau 


ccauaaaagc 


agaagcggua 


ucgaggaaac 


acaugcgaaa 


ggaaaagugc 


480 


uuccagcaug 


gcccauccag 


cgugaagaga 


uggaaaggca 


ccaaaaacaa 


ccggagaguu 


540 


agaaaaacca 


ucaguguaaa 


ugcuagugcc 


gaagagagca 


ucaauacggg 


ccaauccacc 


600 


aggagagcca 


cguacugcau 


acguggcagg 


uucuaaacca 


uacauauuuu 


cauaccaagg 


660 


aggagaacag 


gggaaagcca 


uuuggauaag 


aacaccaaau 


aaauucauau 


aaccaaaagu 


720 


ucgagcccaa 


acuggaagag 


uuccaggagg 


ugcaaagaug 


aaaagaauaa 


augaaaugau 


780 


aaaaggagcc 


gaauaaugca 


ugacuccaua 


uggaacccag 


gccaaaauau 


caaggaugcu 


840 


augcgugguu 


uucgagagaa 


gacuagaaag 


auuagagcca 


uaaagaauau 


uuucaagugu 


900 


ggguaaaaca 


cgaacccaua 


uggguggacg 


ccagcguucu 


ggaauaaacc 


uacaagagua 


960 


aaauaaaauu 


gcccagguga 


ugauaacaau 


ggcaggaaaa 


aaaauuuggc 


guguuaaagg 


1020 


aacggucaac 


gcaauggcca 


aaagacaggc 


aaugccaaau 


uucccccaga 


auccaggaga 


1080 


uucaaugaca 


auacaagcaa 


aaaucaaauu 


accugcuaga 


aacacauauu 


gcaaaugugu 


1140 


ccaugaccau 


uucguauugc 


guagcaaacg 


aaauguaggc 


auaggguuua 


agcuuguuuc 


1200 


caacuuguau 


ugggaugcuc 


gguuacacgc 


agcaaggcgc 


uuuuuuaagg 


ucgaaagagc 


1260 


agacauugcu 


ucaaagaauu 


aucagaguaa 


aaaagggaag 


cguacgaaaa 


aaauuucgua 


1320 


aggaauuaac 


cggaaaacua 


aaggaaaaaa 


aaggaauuuu 


uaugaaggaa 


agaaaguagc 


1380 


uauuaaaugc 


aagugucaag 


cacuuaaaag 


uagcgaugua 


aaauauuuaa 


aaaaagaugg 


1440 


accgauuaac 


caauguucag 


cucacaguug 


ccagcaauca 


gggcuauuuu 


uuuauuuuuu 


1500 


uuauaaaauu 


gcuaauuaua 


uauaauauaa 


uuaguuuauu 


aacuugcuuu 


uccucaaaaa 


1560 


accaauucga 


gaaaggaacu 


uuugcagagg 


caaaaaagcu 


u 
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(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) length: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE type: peptide 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 
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i 



Cys Phe Thr Ser Ser Tyr Phe Pro Asp Asp Arg Arg 

5 10 

(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) molecule TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

Cys Tyr Thr Ser lie Glu Lys Tyr Asp lie Ser Lys Ser Asp Pro 

5 10 15 

Leu Ala Ala Asp 

(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1553 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) topology: linear 

(ii) MOLECULE TYPE: Genomic dna 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

ttttacatat attattcacc caatatcata acaaaaacaa actgaatgat ggcatcttct 60 

attttgcgtt ccaaaataat acaaaaaccg taccaattat tccactacta ttttcttctg 120 

gagaaggctc ctggttctac agttagtgat ttgaattttg atacaaacat acaaacgagt 180 

ttacgtaaat taaagcatca tcattggacg gtgggagaaa tattccatta tgggtttttg 240 

gtttccatac tttttttcgt gtttgtggtt ttcccagctt cattttttat aaaattacca 300 

ataatcttag catttgctac ttgtttttta atacccttaa catcacaatt ttttcttcct 360 

gccttgcccg ttttcacttg gttggcatta tattttacgt gtgctaaaat acctcaagaa 420 



tggaaaccag 


ctatcacagt 


taaagtttta 


ccagctatgg 


aaacaatttt 


gtacggcgat 


480 


aatttatcaa 


atgttttggc 


aaccatcact 


accggagtgt 


tagatatatt 


ggcatggtta 


540 


ccatatggga 


ttattcattt 


cagtttccca 


tttgtacttg 


ctgctattat 


atttttattt 


600 


gggccaccga 


cggcattaag 


atcatttgga 


tttgcctttg 


gttatatgaa 


cttgcttgga 


660 


gtcttgattc 


aaatggcatt 


cccagctgct 


cctccatggt 


acaaaaactt 


gcacggatta 


720 


gaaccagcta 


attattcaat 


gcacgggtct 


cctggtggac 


ttggaaggat 


agataaattg 


780 


ttaggtgttg 


atatgtatac 


cacagggttt 


tccaattcat 


caatcatttt 


tggggcattc 


840 


ccatcgttac 


attcaggatg 


ttgtatcatg 


gaagtgttat 


ttttgtgttg 


gttgtttcca 


900 


cgattcaagt 


ttgtgtgggt 


tacatacgca 


tcttggcttt 


ggtggagcac 


gatgtatttg 


960 


acccatcact 


actttgtcga 


tttgattggt 


ggagccatgc 


tatctttgac 


tgtttttgaa 


1020 


ttcaccaaat 


ataaatattt 


gccaaaaaac 


aaagaaggcc 


ttttctgtcg 


ttggtcatac 


1080 


actgaaattg 


aaaaaatcga 


tatccaagag 


attgaccctt 


tatcatacaa 


ttatatccct 


1140 


gtcaacagca 


atgataatga 


aagcagattg 


tatacgagag 


tgtaccaaga 


gcctcaggtt 


1200 


agtcccccac 


agagagctga 


aacacctgaa 


gcatttgaga 


tgtcaaattt 


ttctaggtct 


1260 


agacaaagct 


caaagactca 


ggttccattg 


agtaatctta 


ctaacaatga 


tcaagtgcct 


1320 


ggaattaacg 


aagaggatga 


agaagaagaa 


ggcgatgaaa 


tttcgtcgag 


tactccttcg 


1380 


gtgtttgaag 


acgaaccaca 


gggtagcaca 


tatgctgcat 


cctcagctac 


atcagtagat 


1440 


gatttggatt 


ccaaaagaaa 


ttagtaaaac 


agcagtttct 


attaatttct 


ttatttcctc 


1500 


ctaattaatg 


attttatgtt 


caatacctac 


actatctgtt 


tttaatttcc 


tac 


1553 



(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 472 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

Met Met Ala Ser Ser lie Leu Arg Ser Lys lie lie Gin Lys Pro 
15 10 15 

Tyr Gin Leu Phe His Tyr Tyr Phe Leu Leu Glu Lys Ala Pro Gly 




20 25 30 

Ser Thr val ser Asp Leu Asn Phe Asp Thr Asn lie Gin Thr Ser 
35 40 45 

Leu Arg Lys Leu Lys His His His Trp Thr val Gly Glu lie Phe 
50 55 60 

His Tyr Gly Phe Leu val Ser He Leu Phe Phe val Phe val val 
65 70 75 

Phe Pro Ala ser Phe Phe lie Lys Leu Pro lie lie Leu Ala Phe 
80 85 90 

Ala Thr Cys Phe Leu lie Pro Leu Thr Ser Gin Phe Phe Leu Pro 
95 100 105 

Ala Leu Pro val Phe Thr Trp Leu Ala Leu Tyr Phe Thr Cys Ala 

110 115 120 

Lys lie Pro Gin Glu Trp Lys Pro Ala lie Thr val Lys val Leu 

125 130 135 

Pro Ala Met Glu Thr lie Leu Tyr Gly Asp Asn Leu Ser Asn Val 

140 145 150 

Leu Ala Thr lie Thr Thr Gly val Leu Asp lie Leu Ala Trp Leu 

155 160 165 

Pro Tyr Gly lie lie His Phe Ser Phe Pro Phe val Leu Ala Ala 

170 175 180 

He lie Phe Leu Phe Gly Pro Pro Thr Ala Leu Arg Ser Phe Gly 

185 190 195 

Phe Ala Phe Gly Tyr Met Asn Leu Leu Gly val Leu lie Gin Met 

200 205 210 

Ala Phe Pro Ala Ala Pro Pro Trp Tyr Lys Asn Leu His Gly Leu 

215 220 225 

Glu Pro Ala Asn Tyr Ser Met His Gly Ser Pro Gly Gly Leu Gly 

230 235 240 

Arg lie Asp Lys Leu Leu Gly val Asp Met Tyr Thr Thr Gly Phe 

245 250 255 

Ser Asn Ser ser lie lie Phe Gly Ala Phe Pro Ser Leu His Ser 
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260 265 270 

Gly Cys Cys lie Met Glu val Leu Phe Leu cys Trp Leu Phe Pro 

275 280 285 

Arg Phe Lys Phe val Trp val Thr Tyr Ala Ser Trp Leu Trp Trp 

290 295 300 

Ser Thr Met Tyr Leu Thr His His Tyr Phe Val Asp Leu lie Gly 

305 310 315 

Gly Ala Met Leu Ser Leu Thr Val Phe Glu Phe Thr Lys Tyr Lys 

320 325 330 

Tyr Leu Pro Lys Asn Lys Glu Gly Leu Phe Cys Arg Trp Ser Tyr 

335 340 345 

Thr Glu lie Glu Lys lie Asp lie Gin Glu lie Asp Pro Leu Ser 

350 355 360 

Tyr Asn Tyr lie Pro val Asn Ser Asn Asp Asn Glu Ser Arg Leu 

365 370 375 

Tyr Thr Arg val Tyr Gin Glu Pro Gin Val Ser Pro Pro Gin Arg 

380 385 390 

Ala Glu Thr Pro Glu Ala Phe Glu Met Ser Asn Phe ser Arg ser 

395 400 405 

Arg Gin Ser Ser Lys Thr Gin val Pro Leu Ser Asn Leu Thr Asn 

410 415 420 

Asn Asp Gin val Pro Gly lie Asn Glu Glu Asp Glu Glu Glu Glu 

425 430 435 

Gly Asp Glu lie Ser ser Ser Thr Pro Ser val Phe Glu Asp Glu 

440 445 450 

Pro Gin Gly Ser Thr Tyr Ala Ala Ser Ser Ala Thr Ser val Asp 

455 460 465 
Asp Leu Asp Ser Lys Arg Asn 

470 



INFORMATION FOR SEQ ID NO: 37: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 22 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) topology: linear 

(ii) MOLECULE TYPE: Other nucleic acid (synthetic DNA) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

gactatttca ttatggggcc cc 22 

(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) length: 30 bases 

(B) type: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid (synthetic DNA) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 

gttaactcga gaaagtgccc atcagtgttc 30 

(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) topology: linear 

(ii) MOLECULE type: Other nucleic acid (synthetic DNA) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

gttaacggta cctcatcgtt acaccgttc 29 

(2) INFORMATION FOR SEQ ID NO: 40: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid (synthetic DNA) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

gctaaacgac aatcctgac 19 

(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) topology: linear 

(ii) molecule type: other nucleic acid (synthetic DNA) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 

cgttggccga ttcattaatg c 21 

(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 401 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) topology: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 

Met Ala Asn Pro Phe Ser Arg Trp Phe Leu Ser Glu Arg Pro Pro 

15 10 15 
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Asn Cys His val Ala Asp Leu Glu Thr Ser Leu Asp Pro His Gin 
20 25 30 

Thr Leu Leu Lys Val Gin Lys Tyr Lys Pro Ala Leu Ser Asp Trp 
35 40 45 

val His Tyr lie Phe Leu Gly ser lie Met Leu Phe val Phe lie 
50 55 60 

Thr Asn Pro Ala Pro Trp lie Phe Lys lie Leu Phe Tyr Cys Phe 
65 70 75 

Leu Gly Thr Leu Phe lie lie Pro Ala Thr Ser Gin Phe Phe Phe 
80 85 90 

Asn Ala Leu Pro lie Leu Thr Trp val Ala Leu Tyr Phe Thr Ser 
95 100 105 

Ser Tyr Phe Pro Asp Asp Arg Arg Pro Pro lie Thr Val Lys Val 

110 115 120 

Leu Pro Ala val Glu Thr He Leu Tyr Gly Asp Asn Leu Ser Asp 

125 130 135 

lie Leu Ala Thr Ser Thr Asn Ser Phe Leu Asp lie Leu Ala Trp 

140 145 150 

Leu Pro Tyr Gly Leu Phe His Phe Gly Ala Pro Phe val Val Ala 

155 160 165 

Ala lie Leu Phe val Phe Gly Pro Pro Thr val Leu Gin Gly Tyr 

170 175 180 

Ala Phe Ala Phe Gly Tyr Met Asn Leu Phe Gly val lie Met Gin 

185 190 195 

Asn val Phe Pro Ala Ala Pro Pro Trp Tyr Lys lie Leu Tyr Gly 

200 205 210 

Leu Gin Ser Ala Asn Tyr Asp Met His Gly Ser Pro Gly Gly Leu 

215 220 225 

Ala Arg lie Asp Lys Leu Leu Gly lie Asn Met Tyr Thr Thr Cys 

230 235 240 

Phe Ser Asn Ser Ser Val lie Phe Gly Ala Phe Pro Ser Leu His 



245 



65 



250 



255 



Ser Gly Cys Ala Thr Met Glu Ala Leu Phe Phe Cys Tyr Cys Phe 

260 265 270 

Pro Lys Leu Lys Pro Leu Phe lie Ala Tyr Val Cys Trp Leu Trp 

275 280 285 

Trp Ser Thr Met Tyr Leu Thr His His Tyr Phe val Asp Leu Met 

290 295 300 

Ala Gly ser val Leu Ser Tyr Val lie Phe Gin Tyr Thr Lys Tyr 

305 310 315 

Thr His Leu Pro lie val Asp Thr Ser Leu Phe Cys Arg Trp ser 

320 325 330 

Tyr Thr Ser lie Glu Lys Tyr Asp lie Ser Lys Ser Asp Pro Leu 

335 340 345 

Ala Ala Asp Ser Asn Asp lie Glu Ser val Pro Leu Ser Asn Leu 

350 355 360 

Glu Leu Asp Phe Asp Leu Asn Met Thr Asp Glu Pro Ser val ser 

365 370 375 

Pro Ser Leu Phe Asp Gly Ser Thr Ser val Ser Arg Ser Ser Ala 

380 385 390 

Thr Ser lie Thr Ser Leu Gly val Lys Arg Ala 

395 400 



INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) length: 401 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 



Met Ala Asn Pro Phe Ser Arg Trp Phe Leu Ser Glu Arg Pro Pro 



66 



10 



15 



Asn Cys His val Ala Asp Leu Glu Thr Ser Leu Asp Pro His Gin 
20 25 30 

Thr Leu Leu Lys val Gin Lys Tyr Lys Pro Ala Leu ser Asp Trp 
35 40 45 

Val His Tyr lie Phe Leu Gly Ser lie Met Leu Phe Val Phe lie 
50 55 60 

Thr Asn Pro Ala Pro Trp lie Phe Lys lie Leu Phe Tyr Cys Phe 
65 70 75 

Leu Gly Thr Leu Phe lie lie Pro Ala Thr Ser Gin Phe Phe Phe 
80 85 90 

Asn Ala Leu Pro lie Leu Thr Trp val Ala Leu Tyr Phe Thr ser 
95 100 105 

Ser Tyr Phe Pro Asp Asp Arg Arg Pro Pro lie Thr val Lys Val 

110 115 120 

Leu Pro Ala val Glu Thr lie Leu Tyr Gly Asp Asn Leu Ser Asp 

125 130 135 

lie Leu Ala Thr ser Thr Asn ser Phe Leu Asp lie Leu Ala Trp 

140 145 150 

Leu Pro Tyr Gly Leu Phe His Tyr Gly Ala Pro Phe val val Ala 

155 160 165 

Ala lie Leu Phe val Phe Gly Pro Pro Thr val Leu Gin Gly Tyr 

170 175 180 

Ala Phe Ala Phe Gly Tyr Met Asn Leu Phe Gly val lie Met Gin 

185 190 195 

Asn Val Phe Pro Ala Ala Pro Pro Trp Tyr Lys lie Leu Tyr Gly 

200 205 210 

Leu Gin ser Ala Asn Tyr Asp Met His Gly Ser pro Gly Gly Leu 

215 220 225 

Ala Arg lie Asp Lys Leu Leu Gly lie Asn Met Tyr Thr Thr Cys 

230 235 240 

Phe ser Asn ser Ser val lie Phe Gly Ala Phe Pro Ser Leu His 



245 



67 



250 



255 



Ser Gly Cys Ala Thr Met Glu Ala Leu Phe Phe Cys Tyr Cys Phe 

260 265 270 

Pro Lys Leu Lys Pro Leu Phe lie Ala Tyr Val Cys Trp Leu Trp 

275 280 285 

Trp Ser Thr Met Tyr Leu Thr His His Tyr Phe val Asp Leu Met 

290 295 300 

Ala Gly Ser val Leu Ser Tyr val lie Phe Gin Tyr Thr Lys Tyr 

305 310 315 

Thr His Leu Pro lie val Asp Thr ser Leu Phe Cys Arg Trp ser 

320 325 330 

Tyr Thr ser lie Glu Lys Tyr Asp lie Ser Lys Ser Asp Pro Leu 

335 340 345 

Ala Ala Asp Ser Asn Asp lie Glu Ser val Pro Leu Ser Asn Leu 

350 355 360 

Glu Leu Asp Phe Asp Leu Asn Met Thr Asp Glu Pro ser val Ser 

365 370 375 

Pro Ser Leu Phe Asp Gly Ser Thr Ser Val Ser Arg Ser Ser Ala 

380 385 390 

Thr ser lie Thr Ser Leu Gly val Lys Arg Ala 

395 400 

(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1206 base pairs 

(B) type: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 

atggcaaacc ctttttcgag atggtttcta tcagagagac ctccaaactg ccatgtagcc 60 

gatttagaaa caagtttaga tccccatcaa acgttgttga aggtgcaaaa atacaaaccc 120 
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gctttaagcg 


actgggtgca 


ttacatcttc 


ttgggatcca 


tcatgctgtt tgtgttcatt 


180 


actaatcccg 


caccttggat 


cttcaagatc 


cttttttatt 


gtttcttggg cactttattc 


240 


atcattccag 


ctacgtcaca 


gtttttcttc 


aatgccttgc 


ccatcctaac atgggtggcg 


300 


ctgtatttca 


cttcatcgta 


ctttccagat 


gaccgcaggc 


ctcctattac tgtcaaagtg 


360 


ttaccagcgg 


tggaaacaat 


tttatacggc 


gacaatttaa 


gtgatattct tgcaacatcg 


420 


acgaattcct 


ttttggacat 


tttagcatgg 


ttaccgtacg 


gactatttca ttttggggcc 


480 


ccatttgtcg 


ttgctgccat 


cttattcgta 


tttggtccac 


caactgtttt gcaaggttat 


540 


gcttttgcat 


ttggttatat 


gaacctgttt 


ggtgttatca 


tgcaaaatgt ctttccagcc 


600 


gctcccccat 


ggtataaaat 


tctctatgga 


ttgcaatcag 


ccaactatga tatgcatggc 


660 


tcgcctggtg 


gattagctag 


aattgataag 


ctactcggta 


ttaatatgta tactacatgt 


720 


ttttcaaatt 


cctccgtcat 


tttcggtgct 


tttccttcac 


tgcattccgg gtgtgctact 


780 


atggaagccc 


tgtttttctg 


ttattgtttt 


ccaaaattga agcccttgtt tattgcttat 


840 


gtttgctggt 


tatggtggtc 


aactatgtat 


ctgacacacc 


attattttgt agaccttatg 


900 


gcaggttctg 


tgctgtcata 


cgttattttc 


cagtacacaa agtacacaca tttaccaatt 


960 


gtagatacat 


ctcttttttg 


cagatggtca 


tacacttcaa 


ttgagaaata cgatatatca 


1020 


aagagtgatc 


cattggctgc 


agattcaaac 


gatatcgaaa 


gtgtcccttt gtccaacttg 


1080 


gaacttgact 


ttgatcttaa 


tatgactgat 


gaacccagtg 


taagcccttc gttatttgat 


1140 


ggatctactt 


ctgtttctcg 


ttcgtccgcc 


acgtctataa 


cgtcactagg tgtaaagagg 


1200 


gcttaa 










1206 



(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 
.(A) LENGTH: 1206 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) topology: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 

atggcaaacc ctttttcgag atggtttcta tcagagagac ctccaaactg ccatgtagcc 60 

gatttagaaa caagtttaga tccccatcaa acgttgttga aggtgcaaaa atacaaaccc 120 

gctttaagcg actgggtgca ttacatcttc ttgggatcca tcatgctgtt tgtgttcatt 180 
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actaatcccg 


caccttggat 


cttcaagatc 


atcattccag 


ctacgtcaca 


gtttttcttc 


ctgtatttca 


cttcatcgta 


ctttccagat 


ttaccagcgg 


tggaaacaat 


tttatacggc 


acgaattcct 


ttttggacat 


tttagcatgg 


ccatttgtcg 


ttgctgccat 


cttattcgta 


gcttttgcat 


ttggttatat 


gaacctgttt 


gctcccccat 


ggtataaaat 


tctctatgga 


tcgcctggtg 


gattagctag 


aattgataag 


ttttcaaatt 


cctccgtcat 


tttcggtgct 


atggaagccc 


tgtttttctg 


ttattgtttt 


gtttgctggt 


tatggtggtc 


aactatgtat 


gcaggttctg 


tgctgtcata 


cgttattttc 


ataaatacat 


ctctttttta 


caaataatca 


aagagtgatc 


cattggctgc 


agattcaaac 


gaacttgact 


ttgatcttaa 


tatgactgat 


ggatctactt 


ctgtttctcg 


ttcgtccgcc 


gcttaa 







cttttttatt gtttcttggg cactttattc 240 

aatgccttgc ccatcctaac atgggtggcg 300 

gaccgcaggc ctcctattac tgtcaaagtg 360 

gacaatttaa gtgatattct tgcaacatcg 420 

ttaccgtacg gactatttca ttatggggcc 480 

tttggtccac caactgtttt gcaaggttat 540 

ggtgttatca tgcaaaatgt ctttccagcc 600 

ttgcaatcag ccaactatga tatgcatggc 660 

ctactcggta ttaatatgta tactacatgt 720 

tttccttcac tgcattccgg gtgtgctact 780 

ccaaaattga agcccttgtt tattgcttat 840 

ctgacacacc attattttgt agaccttatg 900 

cagtacacaa agtacacaca tttaccaatt 960 

tacacttcaa ttgagaaata cgatatatca 1020 

gatatcgaaa gtgtcccttt gtccaacttg 1080 

gaacccagtg taagcccttc gttatttgat 1140 

acgtctataa cgtcactagg tgtaaagagg 1200 

1206 



(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1206 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) topology: linear 

(ii) MOLECULE type: Genomic dna 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 



atggcaaacc ctttttcgag atggtttcta 

gatttagaaa caagtttaga tccccatcaa 

gctttaagcg actgggtgca ttacatcttc 

actaatcccg caccttggat cttcaagatc 



tcagagagac ctccaaactg ccatgtagcc 60 

acgttgttga aggtgcaaaa atacaaaccc 120 

ttgggatcca tcatgctgtt tgtgttcatt 180 

cttttttatt gtttcttggg cactttattc 240 
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atcattccag 


ctacgtcaca 


gtttttcttc 


ctgtatttca 


cttcatcgta 


ctttccagat 


ttaccagcgg 


tggaaacaat 


tttatacggc 


acgaattcct 


ttttggacat 


tttagcatgg 


ccatttgtcg 


ttgctgccat 


cttattcgta 


gcttttgcat 


ttggttatat 


gaacctgttt 


gctcccccat 


ggtataaaat 


tctctatgga 


tcgcctggtg 


gattagctag 


aattgataag 


ttttcaaatt 


cctccgtcat 


tttcggtgct 


atggaagccc 


tgtttttctg 


ttattgtttt 


gtttgctggt 


tatggtggtc 


aactatgtat 


gcaggttctg 


tgctgtcata 


cgttattttc 


gtagatacat 


ctcttttttg 


cagatggtca 


aagagtgatc 


cattggctgc 


agattcaaac 


gaacttgact 


ttgatcttaa 


tatgactgat 


ggatctactt 


ctgtttctcg 


ttcgtccgcc 


gcttaa 







aatgccttgc ccatcctaac atgggtggcg 300 

gaccgcaggc ctcctattac tgtcaaagtg 360 

gacaatttaa gtgatattct tgcaacatcg 420 

ttaccgtacg gactatttca ttatggggcc 480 

tttggtccac caactgtttt gcaaggttat 540 

ggtgttatca tgcaaaatgt ctttccagcc 600 

ttgcaatcag ccaactatga tatgcatggc 660 

ctactcggta ttaatatgta tactacagct 720 

tttccttcac tgcattccgg gtgtgctact 780 

ccaaaattga agcccttgtt tattgcttat 840 

ctgacacacc attattttgt agaccttatg 900 

cagtacacaa agtacacaca tttaccaatt 960 

tacacttcaa ttgagaaata cgatatatca 1020 

gatatcgaaa gtgtcccttt gtccaacttg 1080 

gaacccagtg taagcccttc gttatttgat 1140 

acgtctataa cgtcactagg tgtaaagagg 1200 

1206 



(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) length: 1206 base pairs 

(B) type: nucleic acid 
(c) strandedness: double 
(D) TOPOLOGY: linear 

(ii) molecule type: Genomic dna 

(xi) SEQUENCE description: SEQ ID NO: 47: 



atggcaaacc ctttttcgag atggtttcta 
gatttagaaa caagtttaga tccccatcaa 
gctttaagcg actgggtgca ttacatcttc 
actaatcccg caccttggat cttcaagatc 
atcattccag ctacgtcaca gtttttcttc 



tcagagagac ctccaaactg ccatgtagcc 60 

acgttgttga aggtgcaaaa atacaaaccc 120 

ttgggatcca tcatgctgtt tgtgttcatt 180 

cttttttatt gtttcttggg cactttattc 240 

aatgccttgc ccatcctaac atgggtggcg 300 
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■r- 



ctgtatttca 


cttcatcgta 


ctttccagat 


gaccgcaggc ctcctattac 


tgtcaaagtg 


360 


ttaccagcgg 


tggaaacaat 


tttatacggc 


gacaatttaa gtgatattct 


tgcaacatcg 


420 


acgaattcct 


ttttggacat 


tttagcatgg 


ttaccgtacg gactatttca 


ttttggggcc 


480 


ccatttgtcg 


ttgctgccat 


cttattcgta 


tttggtccac caactgtttt 


gcaaggttat 


540 


gcttttgcat 


ttggttatat 


gaacctgttt 


ggtgttatca tgcaaaatgt 


ctttccagcc 


600 


gctcccccat 


ggtataaaat 


tctctatgga 


ttgcaatcag ccaactatga 


tatgcatggc 


660 


tcgcctggtg 


gattagctag 


aattgataag 


ctactcggta ttaatatgta 


tactacagct 


720 


ttttcaaatt 


cctccgtcat 


tttcggtgct 


tttccttcac tgcattccgg 


gtgtgctact 


780 


atggaagccc 


tgtttttctg 


ttattgtttt 


ccaaaattga agcccttgtt 


tattgcttat 


840 


gtttgctggt 


tatggtggtc 


aactatgtat 


ctgacacacc attattttgt 


agaccttatg 


900 


gcaggttctg 


tgctgtcata 


cgttattttc 


cagtacacaa agtacacaca 


tttaccaatt 


960 


gtagatacat 


ctcttttttg 


cagatggtca 


tacacttcaa ttgagaaata 


cgatatatca 


1020 


aagagtgatc 


cattggctgc 


agattcaaac 


gatatcgaaa gtgtcccttt 


gtccaacttg 


1080 


gaacttgact 


ttgatcttaa 


tatgactgat 


gaacccagtg taagcccttc 


gttatttgat 


1140 


ggatctactt 


ctgtttctcg 


ttcgtccgcc 


acgtctataa cgtcactagg 


tgtaaagagg 


1200 


gcttaa 










1206 



(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid (synthetic DNA) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 

aatatgtata ctacatgttt ttcaaattcc 30 

(2) INFORMATION FOR SEQ ID NO: 49: 
(i) SEQUENCE CHARACTERISTICS: 

(A) length: 30 bases 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid (synthetic DNA) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 

gttaactcga gaaagtgccc atcagtgttc 30 

(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE type: other nucleic acid (synthetic DNA) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 

gttaacggta ccagaggaaa gaataacgc 29 
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